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GENES AND POLYMORPHISMS ASSOCIATED WITH 
CARDIOVASCULAR DISEASE AND THEIR USE 

RELATED APPLICATIONS 

Benefit of priority is claimed to U.S. application Serial No. 
5 09/802,640, entitled "GENES AND POLYMORPHISMS ASSOCIATED 
WITH CARDIOVASCULAR DISEASE AND THEIR USE", filed on March 9, 
2001 by Andreas Braun, Aruna Bansal, and Patrick W. Kleyn. Where 
permitted the subject matter of this application is incorporated by 
reference in its entirety. 

10 FIELD OF THE INVENTION 

The field of the invention involves genes and polymorphisms of 
these genes that are associated with development of cardiovascular 
disease. Methods that use polymorphic markers for prognosticating, 
profiling drug response and drug discovery are provided. 

15 BACKGROUND OF THE INVENTION 

Diseases in all organisms have a genetic component, whether 
inherited or resulting from the body's response to environmental stresses, 
such as viruses and toxins. The ultimate goal of ongoing genomic 
research is to use this information to develop new ways to identify, treat 

20 and potentially cure these diseases. The first step has been to screen 
disease tissue and identify genomic changes at the level of individual 
samples. The identification of these "disease" markers has then fueled 
the development and commercialization of diagnostic tests that detect 
these errant genes or polymorphisms. With the increasing numbers of 

25 genetic markers, including single nucleotide polymorphisms (SNPs), 

microsatellites, tandem repeats, newly mapped introns and exons, the 
challenge to the medical and pharmaceutical communities is to identify 
genotypes that not only identify the disease but also follow the 
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progression of the disease and are predictive of an organisnn's response 
to treatment. 

Polymorphisms 

Polymorphisms have been known since 1901 with the identification 
5 of blood types. In the 1950's they were identified on the level of proteins 
using large population genetic studies. In the 1980's and 1990's many 
of the known protein polymorphisms were correlated with genetic loci on 
genomic DNA. For example, the gene dose of the apolipoprotein E type 4 
allele was correlated with the risk of Alzheimer's disease in late onset 

10 families (see, e.g., Corder et al. (1993) Science 261: 921-923; mutation 
in blood coagulation factor V was associated with resistance to activated 
protein C (see, e.g., Bertina et al. (1994) Nature 369:64-67); resistance 
to HIV-1 infection has been shown in Caucasian individuals bearing 
mutant alleles of the CCR-5 chemokine receptor gene (see, 

15 e.g., Samson eta/. (1996) Nature 382:722-725); and a hypermutable 
tract in antigen presenting cells (APC, such as macrophages), has been 
identified in familial colorectal cancer in individuals of Ashkenzi jewish 
background (see, e.g., Laken eta/. (1997) Nature Genet. 7 7:79-83). 
There may be more than three million polymorphic sites in the human 

20 genome. Many have been identified, but not yet characterized or mapped 
or associated with a disease. Polymorphisms of the genome can lead to 
altered gene function, protein function or mRNA instability. To identify 
those polymorphisms that have clinical relevance is the goal of a world- 
wide scientific effort. Discovery of such polymorphisms will have a 

25 fundamental impact on the identification and development of diagnostics 
and drug discovery. 

Single nucleotide polymorphisms (SNPs) 

Much of the focus of genomics has been in the identification of 
SNPs, which are important for a variety of reasons. They allow indirect 
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testing (association of haplotypes) and direct testing (functional variants). 
They are the most abundant and stable genetic nnarkers. Common 
diseases are best explained by common genetic alterations, and the 
natural variation in the human population aids in understanding disease, 
5 therapy and environmental interactions. 

The organization of SNPs in the primary sequence of a gene into 
one of the limited number of combinations that exist as units of 
inheritance is termed a haplotype. Each haplotype therefore contains 
significantly more information than individual unorganized polymorphisms 

10 and provides an accurate measurement of the genomic variation in the 

two chromosomes of an individual. While it is well-established that many 
diseases are associated with specific variation in gene sequences and 
there are examples in which individual polymorphisms act as genetic 
markers for a particular phenotype, in other cases an individual 

15 polymorphism may be found in a variety of genomic backgrounds and 

therefore shows no definitive coupling between the polymorphism and the 
phenotype. In these instances, the observed haplotype and its frequency 
of occurrence in various genotypes will provide a better genetic marker 
for the phenotype. 

20 Although risk factors for the development of cardiovascular disease 

are known, such as high serum cholesterol levels and low serum high 
density lipoprotein (HDL) levels, the genetic basis for the manifestation of 
these phenotypes remains unknown. An understanding of the genes that 
are responsible for controlling cholesterol and HDL levels, along with 

25 useful genetic markers and mutations in these genes that affect these 
phenotypes, will allow for detection of a predisposition for these risk 
factors and/or cardiovascular disease and the development of therapeutics 
to modulate such alterations. 


wo 02/072604 


PCT/US02/06728 


Therefore, among the objects herein, it is an object herein to 
provide methods and products for detection of a predisposition for these 
risk factors and/or cardiovascular disease. 
SUMMARY OF THE INVENTION 
5 Provided herein are methods for using polymorphic markers to 

detect a predisposition to the manifestation of high serum cholesterol, low 
serum HDL and cardiovascular disease. The ultimate goals are the 
elucidation of pathological pathways, developing new diagnostic assays, 
determining genetic profiles for positive responses to therapeutic drugs, 
10 identifying new potential drug targets and identifying new drug 
candidates. 

A database of twins was screened for individuals that exhibit high 
or low levels of serum cholesterol or HDL. Using a full genome scanning 
approach, SNPs present in DNA samples from these individuals were 

15 examined for alleles that associate with either high levels of cholesterol or 
low levels of HDL. This lead to the discovery of the association of the 
cytochrome C oxidase subunit VIb (COX6B) gene and the N-acetylglu- 
cosaminyl transferase component glycosylphosphatidylinositol-1 (referred 
to herein as GPI-1) gene with these risks factors for developing 

20 cardiovascular disease. Specifically, a previously undetermined 

association of an allelic variant at nucleotide 86 of the C0X6B gene and 
high serum cholesterol levels has been discovered. In addition, it has 
been discovered that an allelic variant at nucleotide 2577 of the GPI-1 
gene is associated with low serum HDL levels. There was no previously 

25 known association between these two genes and risk factors related to 
cardiovascular disease. 

Methods are provided for detecting the presence or absence of at 
least one allelic variant associated with high cholesterol, low HDL and/or 
cardiovascular disease by detecting the presence or absence of at least 
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one allelic variant of the C0X6B gene or the GPI-1 gene, individually or in 
combination with one or more allelic variants of other genes associated 
with cardiovascular disease. 

Also provided are methods for indicating a predisposition to 
5 manifesting high serum cholesterol, low serum HDL and/or cardiovascular 
disease based on detecting the presence or absence of at least one allelic 
variant of the COX6B or GPI-1 genes, alone or in combination with one or 
more allelic variants of other genes associated with cardiovascular 
disease. These methods, referred to as haplotyping, are based on 

10 assaying more than one polymorphism of the COX6B and/or GPI-1 genes. 
One or more polymorphisms of other genes associated with 
cardiovascular disease may also be assayed at the same time. A 
collection of allelic variants of one or more genes may be more 
informative than a single allelic variant of any one gene, A single 

15 polymorphism of a collection of polymorphisms present in the C0X6B 
and/or GPI-1 genes and in other genes associated with cardiovascular 
disease may be assayed individually or the collection may be assayed 
simultaneously using a multiplex assay method. 

Also provided are microarrays that include a probe selected from 

20 among an oligonucleotide complementary to a polymorphic region 

surrounding position 86 of the sense strand of the COX6B gene coding 
sequence; an oligonucleotide complementary to a polymorphic region 
surrounding the position of the antisense strand of C0X6B corresponding 
to position 86 of the sense strand of the COX6B gene coding sequence; 

25 an oligonucleotide complementary to a polymorphic region surrounding 
position 2577 of the sense strand of the GPI-1 gene; and an 
oligonucleotide complementary to a polymorphic region surrounding the 
position of the antisense strand of GPI-1 corresponding to position 2577 
of the sense strand of the GPI-1 gene. Microarrays are well known and 
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can be made, for example, using methods set forth in U.S. Patent Nos, 
5,837,832; 5,858,659; 6,043,136; 6,043,031 and 6,156,501. 

Further provided are methods of using allelic variants of the COX6B 
or GPI-1 gene individually or together with one or more allelic variants of 
5 other genes associated with cardiovascular disease to predict a subject's 
response to a biologically active agent that modulates serum cholesterol, 
serum HDL, or a cardiovascular drug. 

Also provided are methods to screen candidate biologically active 
agents for modulation of cholesterol, HDL or other factors associated with 

10 cardiovascular disease. These methods use cells or transgenic animals 
containing one or more allelic variants of the COX6B gene and/or the 
GPI-1 gene alone or in combination with allelic variants of one or more 
other genes associated with cardiovascular disease. Such animals should 
exhibit high cholesterol, low HDL or other known phenotypes associated 

15 with cardiovascular disease. Also, provided are methods to construct 

transgenic animals that are useful as models for cardiovascular disease by 
using one or more allelic variants of the C0X6B gene and/or the GPI-1 
gene alone or in combination with allelic variants of one or more other 
genes associated with cardiovascular disease. 

20 Further provided are combinations of probes and primers and kits 

for predicting a predisposition to high serum cholesterol, low HDL levels 
and/or cardiovascular disease. In particular, combinations and kits 
contain probes or primers that are capable of hybridizing adjacent to or at 
polymorphic regions of the COX6B and/or GPI-1 gene. The combinations 

25 and kits can also contain probes or primers that are capable of hybridizing 
adjacent to or at polymorphic regions of other genes associated with 
cardiovascular disease. The kits also optionally contain instructions for 
carrying out assays, interpreting results and for aiding in diagnosing a 
subject as having a predisposition towards developing high serum 
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cholesterol, low HDL levels and/or cardiovascular disease. Combinations 
and kits are also provided for predicting a subject's response to a 
therapeutic agent directed toward modulating cholesterol, HDL, or another 
phenotype associated with cardiovascular disease. Such combinations 
5 and kits contain probes or primers as described above. 

In particular for the methods, combinations, kits and arrays 
described above, the polymorphisms are SNPs. The detection or 
identification is of a T nucleotide at position 86 of the sense strand of the 
C0X6B gene coding sequence or the detection or identification of an A 

10 nucleotide at the corresponding position in the antisense strand of the 
COX6B gene coding sequence. Also embodied is the detection or 
identification of an A nucleotide at position 2577 of the sense strand of 
the GPI-1 gene or the detection or identification of a T nucleotide at the 
corresponding position in the antisense strand of the GPI-1 gene. In 

15 addition to the SNPs discussed above, other polymorphisms of the 
C0X6B and GPI-1 genes can be assayed for association with high 
cholesterol or low HDL, respectively, and used as disclosed above. 

Other genes containing allelic variants associated with high serum 
cholesterol, low HDL and/or cardiovascular disease, include, but are not 

20 limited to: cholesterol ester transfer protein, plasma (CETP); 

apolipoprotein A-IV (APO A4); apolipoprotein A-l (APO A1); apolipoprotein 
E (APO E); apolipoprotein B (APO B); apolipoprotein C-lll (APO C3); a 
gene encoding lipoprotein lipase (LPL); ATP-binding cassette transporter 
(ABC 1); paraoxonase 1 (PON 1 );,paraoxonase 2 (PON 2); 5,10- 

25 methylenetetrahydrofolate r reductase (MTHFR); a gene encoding hepatic 
lipase, E-selectin, G protein beta 3 subunit and angiotensin II type 1 
receptor gene. 

The detection of the presence or absence of an allelic variant can 
use, but are not limited to, methods such as allele specific hybridization. 
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primer specific extension, oligonucleotide ligation assay, restriction 
enzyme site analysis and single-stranded conformation polymorphism 
analysis. 

In particular, primers used in primer specific extension hybridize 
5 adjacent to nucleotide 86 of the COX6B gene or nucleotide 2577 of the 
GPI-1 gene or the corresponding positions on the antisense strand 
(numbers refer to GenBank sequences, see pages 15-17). A primer can 
be extended in the presence of at least one dideoxynucleotide, particularly 
ddG, or two dideoxynucleotides, particularly ddG and ddC. Typically, 
10 detection of extension products is by mass spectrometry. Detection of 
allelic variants can also involve signal moieties such as radioisotopes, 
enzymes, antigens, antibodies, spectrophotometric reagents, 
chemiluminescent reagents, fluorescent reagents and other light 
producing reagents. 

15 Other probes and primers useful for the detection of allelic variants 

include those that hybridize at or adjacent to the SNPs described in Tables 
1-3 and specifically those that include SEQ ID NOs.: 5, 10, 43, 48, 53, 
58, 63, 68, 73, 78, 83, 88, 93, 98, 103, 108, 113, and 118. 
DESCRIPTION OF THE DRAWINGS 

20 Figure 1 depicts the allelic frequency and genotype for pools and 

individually determined samples of blood from individuals having low 
cholesterol levels and those with high cholesterol levels. 

Figure 2 depicts the allelic frequency and genotype for pools and 
individually determined samples of blood from individuals having high HDL 

25 levels and those with low HDL levels. 
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DETAILED DESCRIPTION 
A. Definitions 

Unless defined otherwise, all technical and scientific terms used 
herein have the same meaning as is commonly understood by one of skill 
5 in the art to which this invention belongs. All patents, patent applications 
and publications referred to throughout the disclosure herein are, unless 
noted otherwise, incorporated by reference in their entirety. In the event 
that there are a plurality of definitions for terms herein, those in this 
section prevail. 

10 As used herein, sequencing refers to the process of determining a 

nucleotide sequence and can be performed using any method known to 
those of skill in the art. For example, if a polymorphism is identified or 
known, and it is desired to assess its frequency or presence in nucleic 
acid samples taken. from the subjects that of the database, the region of 

15 interest from the samples can be isolated, such as by PGR or restriction 
fragments, hybridization or other suitable method known to those of skill 
in the art, and sequenced. For purposes herein, sequencing analysis, for 
example, can be effected using mass spectrometry (see, e.g., U.S. Patent 
Nos. 5,547,835, 5,622,824, 5,851,765, and 5,928,906). Nucleic acids 

20 also can be sequenced by hybridization (see, e.g., U.S. Patent Nos. 
5,503,980, 5,631,134, 5,795,714) and including analysis by mass 
spectrometry (see, U.S. Application Serial Nos. 08/419,994 and 
09/395,409). Alternatively, sequencing may be performed using other 
known methods, such as set forth in U.S. Patent Nos. 5,525,464; 

25 5,695,940; 5,834,189; 5,869,242; 5,876,934; 5,908,755; 5,912,118; 
5,952,174; 5,976,802; 5,981,186; 5,998,143; 6,004,744; 6,017,702; 
6,018,041; 6,025,136; 6,046,005; 6,087,095; 6,117,634, 6,013,431, 
WO 98/30883; WO 98/56954; WO 99/09218; WO/00/58519, and the 
others. 
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As used herein, "polymorphism" refers to the coexistence of more 
than one form of a gene or portion thereof. A portion of a gene of which 
there are at least two different forms, i.e., two different nucleotide 
sequences, is referred to as a "polymorphic region of a gene". A 
5 polymorphic region can be a single nucleotide, the identity of which 
differs in different alleles. A polymorphic region also can be several 
nucleotides in length. 

As used herein, "polymorphic gene" refers to a gene having at least 
one polymorphic region. 

10 As used herein, "allele", which is used interchangeably herein with 

"allelic variant" refers to alternative forms of a gene or portions thereof. 
Alleles occupy the same locus or position on homologous chromosomes. 
When a subject has two identical alleles of a gene, the subject is said to 
be homozygous for the gene or allele. When a subject has two different 

15 alleles of a gene, the subject is said to be heterozygous for the gene. 

Alleles of a specific gene can differ from each other in a single nucleotide, 
or several nucleotides, and can include substitutions, deletions, and 
insertions of nucleotides. An allele of a gene also can be a form of a gene 
containing a mutation. 

20 As used herein, the term "subject" refers to mammals and in 

particular human beings. 

As used herein, the term "gene" or "recombinant gene" refers to a 
nucleic acid molecule comprising an open reading frame and including at 
least one exon and (optionally) at least one intron sequence. A gene can 

25 be either RNA or DNA. Genes may include regions preceding and 
following the coding region (leader and trailer). 

As used herein, "intron" refers to a DNA sequence present in a 
given gene that is spliced out during mRNA maturation. 
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As used herein, the term "coding sequence" refers to that portion 
of a gene that encodes an annino acid sequence of a protein. 

As used herein, the term "sense strand" refers to that strand of a 
double-stranded nucleic acid molecule that encodes the sequence of the 
5 mRNA that encodes the amino acid sequence encoded by the double- 
stranded nucleic acid molecule. 

As used herein, the term "antisense strand" refers to that strand of 
a double-stranded nucleic acid molecule that is the complement of the 
sequence of the mRNA that encodes the amino acid sequence encoded 
10 by the double-stranded nucleic acid molecule. 

As used herein, a DNA or nucleic acid homolog refers to a nucleic 
acid that includes a preselected conserved nucleotide sequence. By the 
term "substantially homologous" is meant having at least 80%, preferably 
at least 90%, most preferably at least 95% homology therewith or a less 
15 percentage of homology or identity and conserved biological activity or 
function. 

Regarding hybridization, as used herein, stringency conditions to 
achieve specific hybridization refer to the washing conditions for 
removing the non-specific probes or primers and conditions that are 
20 equivalent to either high, medium, or low stringency as described below: 

1) high stringency: 0.1 x SSPE, 0,1% SDS, 65°C 

2) medium stringency: 0.2 x SSPE, 0. 1 % SDS, 50°C 

3) low stringency: 1 .0 x SSPE, 0.1 % SDS, 50°C. 

It is understood that equivalent stringencies may be achieved using 
25 alternative buffers, salts and temperatures. 

As used herein, "heterologous DNA" is DNA that encodes RNA and 
proteins that are not normally produced in vivo by the cell in which it is 
expressed or that mediates or encodes mediators that alter expression of 
endogenous DNA by affecting transcription, translation, or other 
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regulatable biochemical processes or is not present in the exact 
orientation or position as the counterpart DNA in a wildtype cell. 
Heterologous DNA nnay also be referred to as foreign DNA. Any DNA 
that one of skill in the art would recognize or consider as heterologous or 
5 foreign to the cell in which is expressed is herein encompassed by 

heterologous DNA. Examples of heterologous DNA include, but are not 
limited to, DNA that encodes traceable marker proteins, such as a protein 
that confers drug resistance, DNA that encodes therapeutically effective 
substances, such as anti-cancer agents, enzymes and hormones, and 
10 DNA that encodes other types of proteins, such as antibodies. Antibodies 
that are encoded by heterologous DNA may be secreted or expressed on 
the surface of the cell in which the heterologous DNA has been 
introduced. 

As used herein, a "promoter region" refers to the portion of DNA of 
15 a gene that controls transcription of the DNA to which it is operativeiy 

linked. The promoter region includes specific sequences of DNA that are 
sufficient for RNA polymerase recognition, binding and transcription 
initiation. This portion of the promoter region is referred to as the 
promoter. In addition, the promoter region includes sequences that 
20 modulate this recognition, binding and transcription initiation activity of 
the RNA polymerase. These sequences may be cis acting or may be 
responsive to trans acting factors. Promoters, depending upon the nature 
of the regulation, may be constitutive or regulated. 

As used herein, the phrase "operativeiy linked" generally means the 
25 sequences or segments have been covalently joined into one piece of 
DNA, whether in single or double stranded form, whereby control or 
regulatory sequences on one segment control or permit expression or 
replication or other such control of other segments. The two segments 
are not necessarily contiguous. For gene expression a DNA sequence and 
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a regulatory sequence(s) are connected in such a way to control or permit 
gene expression when the appropriate molecular, e.g., transcriptional 
activator proteins, are bound to the regulatory sequence(s). 

As used herein, the term "vector" refers to a nucleic acid molecule 
5 capable of transporting another nucleic acid to which it has been linked. 
One exemplary type of vector is an episome, i.e., a nucleic acid capable 
of extra-chromosomal replication. Exemplary vectors include those 
capable of autonomous replication and/or expression of nucleic acids to 
which they are linked. Vectors capable of directing the expression of 

10 genes to which they are operatively linked are referred to herein as 
"expression vectors". In general, expression vectors of utility in 
recombinant DNA techniques are often in the form of "plasmids" that 
refer generally to circular double stranded DNA loops which, in their 
vector form are not bound to the chromosome. "Plasmid" and "vector" 

15 are used interchangeably as the plasmid is the most commonly used form 
of vector. Also included are other forms of expression vectors that serve 
equivalent functions and that become known in the art subsequently 
hereto. 

As used herein, "indicating" or "determining" means that the 
20 presence or absence of an allelic variant may be one of many factors that 
are considered when a subject's predisposition to a disease or disorder is 
evaluated. Thus a predisposition to a disease or disorder is not 
necessarily conclusively determined by only ascertaining the presence or 
absence of one or more allelic variants, but the presence of one of more 
25 of such variants is among an number of factors considered. 

As used herein, "predisposition to develop a disease or disorder" 
means that a subject having a particular genotype and/or haplotype has a 
higher likelihood than one not having such a genotype and/or haplotype 
for developing a particular disease or disorder. 
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As used herein, "transgenic animar' refers to any animal, generally 
a non-human animal, e.g, a mammal, bird or an amphibian, in which one 
or more of the cells of the animal contain heterologous nucleic acid 
introduced by way of human intervention, such as by transgenic 
5 techniques well known in the art. The nucleic acid is introduced into the 
cell, directly or indirectly by introduction into a precursor of the cell, by 
way of deliberate genetic manipulation, such as by microinjection or by 
infection with a recombinant virus. The term genetic manipulation does 
not include classical cross-breeding, or in vitro fertilization, but rather is 

10 directed to the introduction of a recombinant DNA molecule. This 
molecule may be integrated within a chromosome, or it may be 
extrachromosomally replicating DNA. In the typical transgenic animals 
described herein, the transgene causes cells to express a recombinant 
form of a protein. However, transgenic animals in which the recombinant 

15 gene is silent are also contemplated, as for example, using the FLP or CRE 
recombinase dependent constructs. Moreover, "transgenic animal" also 
includes those recombinant animals in which gene disruption of one or 
more genes is caused by human intervention, including both 
recombination and antisense techniques. 

20 As used herein, "transgene" describes genetic material that has 

been or is about to be artificially inserted into the genome of a mammalian 
cell, particularly a mammalian cell of a living animal. The transgene is 
used to transform a cell, meaning that a permanent or transient genetic 
change, typically a permanent genetic change, is induced in a cell 

25 following incorporation of exogenous DNA. A permanent genetic change 
is generally achieved by introduction of the DNA into the genome of the 
cell. Vectors for stable integration include, but are not limited to, 
plasmids, retroviruses and other animal viruses and YACS. Of interest are 
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transgenic mammals, including, but are not limited to, cows, pigs, goats, 
horses and others, and particularly rodents, including rats and mice. 

As used herein, "associated" refers to coincidence with the 
development or manifestation of a disease, condition or phenotype. 
5 Association may be due to, but is not limited to, genes responsible for 
housekeeping functions, those that are part of a pathway that is involved 
in a specific disease, condition or phenotype and those that indirectly 
contribute to the manifestation of a disease, condition or phenotype. 

As used herein, "high serum cholesterol" refers to a level of serum 
10 cholesterol that is greater than that considered to be in the normal range 
for a given age in a population, e.g., about 5.25 mmoles/L or greater, /.e., 
approximately one standard deviation or more away from the age-adjusted 
mean. 

As used herein, "low serum HDL" refers to a level of serum HDL 
15 that is less than that considered to be in the normal range for a given age 
in a population, e.g. about 1.11 mmoles/L or less, i.e., approximately one 
standard deviation or more away from the age-adjusted mean. 

As used herein, "cardiovascular disease" refers to any 
manifestation of or predisposition to cardiovascular disease including, but 
20 not limited to, coronary artery disease and myocardial infarction. Included 
in predisposition is the manifestation of risks factors such as high serum 
cholesterol levels and low serum HDL levels. 

As used herein, "target nucleic acid" refers to a nucleic acid 
molecule that contains all or a portion of a polymorphic region of a gene 
25 of interest. 

As used herein, "signal moiety" refers to any moiety that allows for 
the detection of a nucleic acid molecule. Included are moieties covalently 
attached to nucleic acids and those that are not. 
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As used herein, "biologically active agent that modulates serunn 
cholesterol" refers to any drug, including, but are not limited to, small 
molecule, nucleic acid (sense and antisense), protein, peptide, lipid, 
carbohydrate and combinations thereof, that exhibits some effect directly 
5 or indirectly on the cholesterol measured in a subject's serum. 

As used herein, "biologically active agent that modulates serum 
HDL" refers to any drug, such as, but are not limited to, small molecule, 
nucleic acid (sense and antisense), protein, peptide, lipid, carbohydrate 
and combinations thereof that exhibits some effect directly or indirectly 
10 on the HDL measured in a subject's serum. 

As used herein, "expression and/or activity" refers to the level of 
transcription or translation of the C0X6B or GPI-1 gene, mRNA stability, 
protein stability or biological activity. 

As used herein, "cardiovascular drug" refers to a drug used to treat 
15 cardiovascular disease or a risk factor for the disease, either 

prophylactically or after a risk factor or disease condition has developed. 
Cardiovascular drugs include those drugs used to lower serum cholesterol 
and those used to alter the level of serum HDL. 

As used herein, "combining" refers to contacting the biologically 
20 active agent with a cell or animal such that the agent is introduced into 
the cell or animal. For a cell any method that results in an agent 
traversing the plasma membrane is useful. For an animal any of the 
standard routes of administration of an agent, e.g. oral, rectal, 
transmucosal, intestinal, intravenous, intraperitoneal, intraventricular, 
25 subcutaneous, intramuscular and other routes can be used. 

As used herein, "positive response" refers to improving or 
ameliorating at least one symptom or detectable characteristic of a 
disease or condition, e.g., lowering serum cholesterol levels or raising 
serum HDL levels. 
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As used herein, "biological sample" refers to any cell type or tissue 
of a subject from which nucleic acid, particularly DNA, can be obtained. 

As used herein, "array" refers to a collection of three or more 
items, such a collection of immobilized nucleic acid probes arranged on a 
5 solid substrate, such as silica, polymeric materials, glass and other 
suitable support materials known to those of skill in the art. 

As used herein, a composition refers to any mixture. It may be a 
solution, a suspension, liquid, powder, a paste, aqueous, non-aqueous or 
any combination thereof. 
10 As used herein, a combination refers to any association between 

two or among more items. 

As used herein, "kit" refers to a package that contains a 
combination, such as one or more primers or probes used to amplify or 
detect polymorphic regions of genes associated with cardiovascular 
15 disease, optionally including instructions and/or reagents for their use. 

As used herein "specifically hybridizes" refers to hybridization of a 
probe or primer only to a target sequence preferentially to a non-target 
sequence. Those of skill in the art are familiar with parameters that affect 
hybridization; such as temperature, probe or primer length and 
20 composition, buffer composition and salt concentration and can readily 
adjust these parameters to achieve specific hybridization of a nucleic acid 
to a target sequence. 

As used herein "nucleic acid" refers to polynucleotides such as 
deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). The term should 
25 also be understood to include, as equivalents, derivatives, variants and 
analogs of either RNA or DNA made from nucleotide analogs, single 
(sense or antisense) and double-stranded polynucleotides. 
Deoxyribonucleotides include deoxyadenosine, deoxycytidine, 
deoxyguanosine and deoxythymidine. For RNA, the uracil base is uridine. 
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As used herein, "mass spectrometry" encompassses any suitable 
mass spectrometric format known to those of skill in the art. Such 
formats include, but are not limited to, Matrix-Assisted Laser 
Desorption/lonization, Time-of-Flight (MALDI-TOF), Electrospray (ES), IR- 
5 MALDI (see, e.g., published International PCT Application No. 99/57318 
and U.S. Patent No. 5,118,937) Ion Cyclotron Resonance (ICR), Fourier 
Transform and combinations thereof. MALDI, particular UV and IR, are 
among exemplary formats. 

As used herein, the GPI-1 gene is generically used to include the 

10 human GPI-1 gene and its homologs from rat, mouse, guinea pig, mouse 
and other mammalian species. As described below, the GPI-1 gene refers 
to a component of the GlcNAc transferase activity complex that functions 
in the biosynthesis of glycosylphosphatidylinositol (GPI) anchor. Four 
mammalian gene products (PIG-A, PIG-H, PIG-C and GPI-1) form a protein 

15 complex that is responsible for the transferase enzyme activity in the 

biosynthesis reaction . PIG-A, PIG-H, PIG-C are required for the first step 
in GPI anchor biosynthesis; GPI-1 is not. Stabilization of the enzyme 
complex, rather than participation in GlcNAc transfer, has been suggested 
as a possible role for GPI-1 (Watanabe et al. EMBO 17:877, 1998). 

20 B. Cytochrome c oxidase Vib gene 

Cytochrome c oxidase (COX) is a mitochondrial enzyme complex 
integrated in the inner membrane. It transfers electrons from cytochrome 
to molecular oxygen in the terminal reaction of the respiratory chain in 
eukaryotic cells. COX contains of three large subunits encoded by the 

25 mitochondrial genome and 10 other subunits, encoded by nuclear genes. 
The three subunits encoded by mitochondrial genome are responsible for 
the catalytic activity. The cytochrome c oxidase subunit VIb (COX6B) is 
one of the nuclear gene products. The function of the nuclear encoded 
subunits is unknown. One proposed role is in the regulation of catalytic 
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activity; specifically the rate of electron transport and stoichiometry of 
proton punnping. Other proposed roles are not directly related to electron 
transport and include energy-dependent calcium uptake and protein import 
by the mitochondrion. Proteolytic removal of subunits Via and VIb has 
5 been associated with loss of calcium transport in reconstituted vesicles. 
Steady-state levels of the C0X6B transcript are different in different 
tissues (Taanman etaL, Gene (1990), 93:285). The COX6B gene is 
includes the human C0X6B gene and its homologs from rat, mouse, 
guinea pig, and any species that has a homologous gene. 

10 Several single nucleotide polymorphism have been identified in the 

human COX6B gene. One of these is located at position 86 and is a C to 
T transversion that is manifested as a silent mutation in the coding region, 
AGO to ACT (threonine to threonine)(SEQ ID NO.: 2). Although this is a 
silent mutation at the amino acid level, it may represent an alteration that 

15 changes codon usage, or it may effect mRNA stability or it may be in 
linkage disequilibrium with a non-silent change. Other known single 
nucleotide polymorphisms of the C0X6B gene include, but are not limited 
to, those listed in Table 1 . 

TABLE 1 


20 


Gene 

GenBank Accession No. 

SNP 

SNP Location 

COX6B 

NM 001863 

C/T 

86 

(SEQ ID NO.: 1) 


A/G 

60 



A/T 

324 



A/T 

123 


25 
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Based on methods disclosed herein and those used in the art, one 
of skill would be able to use all the SNPs described and find additional 
polymorphic regions of the C0X6B gene to determine whether allelic 
variants of these regions are associated with high cholesterol levels and 
5 cardiovascular disease. 
C. GPI-1 Gene 

Glycosylphosphatidylinositol (GPI) functions to anchor various 
eukaryotic proteins to membranes and is essential for their surface 
expression. Thus, a defect in GPI anchor synthesis affects various 

10 functions of cell, tissues and organs. Biosynthesis of glycosyl- 
phosphatidylinositol (GPI) is initiated by the transfer of N- 
acetylglucosamine (GlcNAc) from UDP-GlcNac to phosphatidylinositol (PI) 
and is catalyzed by a GlcNAc transferase, GPI-GlcNAc transferase (GPI- 
GnT). Four mammalian gene products form a protein complex that is 

15 responsible for this enzyme activity (PIG-A, PIG-H, PIG-C and GPI-1). 
PIG-A, PIG-H, PIG-C are required for the first step in GPI anchor 
biosynthesis; GPI-1 is not. Stabilization of the enzyme complex, rather 
than participation in GlcNAc transfer, has been suggested as a possible 
role for GPI-1 (Watanabe ef a/. EMBO 17:877, 1998). 

20 A polymorphism has been identified at position 2577 of the human 

GPI-1 gene. This is a G to A transversion. This SNP is located in the 3' 
untranslated region of the mRNA, and does not affect protein structure, 
but may affect mRNA stability or may be in linkage disequilibrium with a 
non-silent change. Other known single nucleotide polymorphisms of the 

25 GPI-1 gene include, but are not limited to, those listed in Table 2. 
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TABLE 2 


Gene 

GenBank Accession No. 

SNP 

SNP Location 

GPI-1 

NM_004204 

C/T 

2829 

(SEQ ID NOS.: 6, 7) 


A/G 

2577 



C/T 

2519 



C/T 

2289 



C/T 

1938 



C/G 

1563 



A/G/C/T 

2664 



A/G 

2656 



A/C/T 

2167 



G/C/A 

2166 


Based on methods disclosed herein and those used in the art, one 
15 of skill would be able to use all the described SNPs and find additional 
polynnorphic regions of the GPI-1 gene to determine whether allelic 
variants of these regions are associated with low levels of HDL and 
cardiovascular disease. 

D. Other genes and polymorphism associated with cardiovascular 
20 disease 

Many other genes and polymorphisms contained within them have 

been associated with risks factors for cardiovascular disease (aberrations 

in lipid metabolism; specifically high levels of serum cholesterol and low 

levels of HDL and other such indicators) and/or the clinical phenotypes of 

25 atherosclerosis and cardiovascular disease. Table 3 presents a list of 
some of these genes and some associated polymorphisms (SNPs): 
cholesterol ester transfer protein, plasma (CETP); apolipoprotein A-IV 
(APO A4); apolipoprotein A-l (APO A1); apolipoprotein E (APO E); 
apolipoprotein B (APO B); apolipoprotein C-lll (APO C3); a gene encoding 

30 lipoprotein lipase (LPL); ATP-binding cassette transporter (ABC 1); 
paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 5,10- 
methylenetetrahydrofolate r reductase (MTHFR); a gene encoding hepatic 
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lipase (LIPC); E-selectin; G protein beta 3 subunit and angiotensin II type 
1 receptor gene. The SNP locations are based on the GenBank sequence. 
Table 3 is not meant to be exhaustive, as one of skill in the art based on 
the disclosure would be able to readily use other known polymorphisms in 
5 these and other genes, new polymorphisms discovered in previously 
identified genes and newly identified genes and polymorphisms in the 


methods and compositions disclosed herein. 

TABLE 3 


20 


Gene 

GenBank Accession No. 

SNP 

SNP 

Location 

CETP 

(SEQ ID NOS.: 11,12) 

NM_000078 

C/A 

991 

C/T 

196 

A/G 

1586 

A/G 

1394 

A/G 

1439 

C/G 

1297 

C/T 

766 

G/A 

1131 

G/A 

1696 

LPL 

(SEQ ID NOS.: 13, 14) 

NM 000237 

A/G 

1 127 

A/C 

3447 

C/T 

1973 

C/T 

3343 

G/A 

2851 

C/T 

3272 

A/T 

2428 

T/C 

2743 

G/A 

1453 

C/A 

3449 

G/A 

1282 

G/A 

579 

A/C 

1338 

A/G/T/C 

2416-2426 

A/G 

2427 

C/T 

1302 

G/A 

609 


wo 02/072604 


PCT/US02/06728 


-23- 


TABLE 3 




G/C 

1595 

G/A 

1309 

C/T 

2454 

C/T 

2988 

G/A 

280 

G/A 

1036 

APO A4 

(SEQ ID NOS.: 15, 16) 

NM_000482 

G/T 

1 122 

G/C 

1033 

G/A 

1002 

C/T 

960 

C/T 

894 

G/A 

554 

G/A 

950 

T/C 

336 

G/A 

334 

C/T 

330 

A/G 

201 

A/G 

16 

A/T 

1213 

APO E 

(SEQ ID NOS.: 17, 18) 
(mRNA) 

NM_000041 

C/T 

448 

G/A 

448 

C/T 

586 

C/T 

197 

C/T 

540 

Hepatic Lipase 

(SEQ ID NOS.: 19, 20) 

NM_000236 

C/G 

680 

G/A 

1374 

G/A 

701 

C/A 

1492 

A/G 

648 

G/C 

729 

G/A 

340 

G/T 

522 

PON 1 

(SEQ ID NOS.: 21, 22) 

NM 000446 

A/T 

172 

A/Q 

584 

G/C 

190 
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TABLE 3 


PON 2 

(SEQ ID NOS.: 23, 24) 

XM 004947 

C/G 

475 

C/G 

964 

APO C3 

(SEQ ID NOS.: 25, 26) 

NM 000040 

C/T 

148 

T/A 

471 

G/C 

386 

G/T 

417 

T/A 

495 

ABC 1 

(SEQ ID NOS.: 27, 28) 

XM_005567 

G/A 

8591 

APO A1 

(SEQ ID NOS.: 29, 30) 

NM 000039 

C/G 

770 

G/A 

656 

C/G 

589 

C/G 

414 

A/T 

430 

C/T 

708 

C/T 

221 

T/G 

223 

C/T 

597 

A/G 

340 

G/C 

690 

APO B 

(SEQ ID NOS.: 31, 32) 

NM 000384 

A/G/C/T 

13141 

A/G/C/T 

12669 

C/T 

1 1323 

G/C 

10422 

A/C 

10408 

C/G 

10083 

C/T 

7064 

C/T 

6666 

C/T 

1980 

C/G 

5751 

C/T 

7673 

C/A/G/T 

8344 

G/C/T/A 

4393 

A/C/T/G 

5894 

A/T 

12019 

C/T 

1 1973 
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TABLE 3 





G/C/T/A 

7065 




C/G 

947 




C/G 

7331 




A/G 

7221 

5 



G/C 

6402 




G/C 

3780 




C/G 

1661 




A/T 

8167 




C/A 

8126 

10 



C/T 

421 




C/T 

1981 




G/A 

12510 




G/C 

12937 


APO B (con't) 


G/A 

11042 

15 



C/T 

2834 




A/G 

5869 




A/G 

1 1962 




C/G 

4439 




G/A 

7824 

20 



G/A 

13569 




G/A 

9489 




G/A 

2325 




G/A 

10259 




C/G 

14 

25 

MTHFR 

NM 005957 

G/A 

5442 


(SEQ ID NOS.: 33, 34) 


A/G 

5113 




A/G 

5113 




A/G 

5110 




A/G 

5102 

30 



A/C/T 

5097 




A/C/T 

5097 




C/T 

5079 




C/T 

5079 




T/C 

5071 

35 



T/C 

5071 




T/C 

5051 
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TABLE 3 


10 


15 


20 


25 


30 


35 


MTHFR (con't) 


G/A 

5012 

C/A 

5000 

A/G 

4998 

A/G 

4994 

A/G 

4994 

A/G 

4994 

C/T 

4991 

C/T 

4991 

C/T 

4991 

A/G 

4986 

A/G 

4986 

A/G 

4986 

C/T 

4985 

T/A 

4982 

T/G 

4981 

T/C 

4981 

T/C 

4981 

G/C/A 

4967 

G/A 

4963 

A/G 

4962 

G/C/T 

4962 

A/C/G/T 

4961 

A/C/T 

4961 

A/C 

4961 

A/C 

4961 

A/C/T 

4960 

T/C 

4938 

T/C 

4937 

T/C 

4933 

G/C/T 

4933 

C/T 

4929 

C/T 

4929 

T/A/G 

4929 

A/G 

4928 

G/C 

4928 

C/G 

4927 
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G/A 

4923 




C/T 

4919 




A/T/G 

4913 




C/T 

4912 

5 



A/T 

4903 




C/T 

4902 




A/G 

4900 




G/A 

4898 




G/T 

4898 

10 



C/T 

4897 




G/T 

4894 




T/C/G 

4836 




C/T 

3862 




C/T 

4922 

15 



C/T 

4959 




T/C 

4981 




A/G 

4994 




A/G 

5044 




T/C 

5051 

20 



G/C 

5066 




C/T 

5079 


MTHFR (con't) 


C/A/G 

5085 




C/T 

5092 




A/G 

5103 

25 



A/G 

51 13 




C/T 

1021 


E-Selectin 

NM_000450 

G/A 

3484 


(SEQ ID NOS.: 35, 36) 


G/A 

3093 




T/G 

2939 

30 



T/C 

2902 




C/T 

1937 




C/T 

1916 




C/T 

1839 




C/T 

1805 

35 



C/T 

1518 




G/C 

1377 
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C/T 

1376 

G/A ' 

999 

T/C 

857 

A/C 

561 

C/G 

506 

A/G 

392 

G/T 

98 

G protein [33 subunit 
(SEQ ID NOS.: 37, 38) 

NM_002075 

C/T 

1828 

C/T 

1546 

G/T 

1431 

G/A 

1231 

C/T 

1230 

Angiotensin II type 1 

receptor gene 

(SEQ ID NOS.: 39, 40) 

NM 00686 

G/A 

1453 

C/G 

968 

G/C 

966 

T/C 

941 

G/A 

894 

T/C 

659 


20 Assays to identify the nucleotide present at the polymorphic site 

include those described herein and all others known to those who practice 
the art. 

For Sonne of the SNPs described above, there are provided a 
description of the MassEXTEND^^ reaction components that can be used 

25 to determine the allelic variant that is present. Included are the forward 
and reverse primers used for amplification. Also included are the 
MassEXTEND'" primer used in the primer extension reaction and the 
extended MassEXTEND'" primers for each allele. MassEXTEND'" 
reactions are carried out and the products analyzed as described in 

30 Examples 2 and 3. 
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CETP 

Position 991 (C/A) 
5 PCR primers: 
Forward: 

10 Reverse: 

MassEXTEND™ Primer: 

15 

Primer Mass: 

Extended Primer-Allele C: 

20 

Extended Primer Mass: 
Extended Primer-Allele A: 

25 

Extended Primer Mass: 
Position 196 (C/T) 
30 PCR primers: 
Forward: 

35 Reverse: 

MassEXTEND™ Primer: 

40 

Primer Mass: 

Extended Primer-Allele C: 


-29- 


ACTGCCTGATAACCATGCTG 
(SEQ ID NO.: 41) 

ATACTTACACACCAGGAGGG 
(SEQ ID NO.: 42) 

ATGCCTGCTCCAAAGGCAC 
(SEQ ID NO.: 43) 

5757.8 

ATGCCTGCTCCAAAGGCACC 
(SEQ ID NO.: 44) 

6030.9 

ATGCCTGCTCCAAAGGCACAT 
(SEQ ID NO.: 45) 

6359.2 


TACTTCTGGTTCTCTGAGCG 
(SEQ ID NO.: 46) 

ACTCACCTTGAACTCGTCTC 
(SEQ ID NO.: 47) 

TGGTTCTCTGAGCGAGTCTT 
(SEQ ID NO.: 48) 

6130 

TGGTTCTCTGAGCGAGTCTTC 
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Extended Primer Mass: 
5 Extended Prinner-Allele T: 

Extended Primer Mass: 
10 Position 1586 (A/G) 

PGR primers: 
Forward: 

15 

Reverse: 

20 MassEXTEND'" Primer: 

Primer Mass: 
25 Extended Primer-Aflele G: 

Extended Primer Mass: 
30 Extended Primer-Allele A: 

Extended Primer Mass: 
35 APOA4 

Position 1122 (G/T) 

PGR primers: 

40 

Forward: 


-30- 

(SEQ ID NO.: 49) 
6707.4 

TGGTTGTGTGAGGGAGTGTTTG 
(SEQ ID NO.: 50) 

6333.1 

TGGAGATGGAGTTTGGGTTG 
(SEQ ID NO.: 51) 

TGGTTGGGTTGTGGTAGAAG 
(SEQ ID NO.: 52) 

GTTGGGTGAGCAGCTGGTG 
(SEQ ID NO.: 53) 

5715.7 

CTTCGGTGAGGAGCTGCTGGT 
(SEQ ID NO.: 54) 

6333.1 

GTTGGGTGAGGAGGTGGTGA 
(SEQ ID NO.: 55) 

6012.9 


AACAGCTGAGGAGGAAAGTG 
(SEQ ID NO.: 56) 
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Reverse: 


MassEXTEND'" Primer: 

5 

Primer Mass: 

Extended Primer-Allele G: 

10 

Extended Primer Mass: 
Extended Primer-Allele T: 

15 

Extended Primer Mass: 
Position 1033 (G/C) 

20 

PGR primers: 
Forward: 

25 

Reverse: 


MassEXTEND™ Primer: 

30 

Primer Mass: 

Extended Primer-Allele G: 

35 

Extended Primer Mass: 
Extended Primer-Allele C: 

40 

Extended Primer Mass: 


AGAAGGAGTTGACCTTGTCC 
(SEQ ID NO.: 57) 

GGAAGCTCAAGTGGCCTTC 
(SEQ ID NO.: 58) 

5828.8 

GGAAGCTCAAGTGGCCTTCC 
(SEQ ID NO.: 59) 

6102.0 

GGAAGCTCAAGTGGCCTTCAAC 
(SEQ ID NO.: 60) 

6728.4 


AAGTCACTGGCAGAGCTGG 
(SEQ ID NO.: 61) 

GCACCAGGGCTTTGTTGAAG 
(SEQ ID NO.: 62) 

TTTTCCCCGTAGGGCTCCA 
(SEQ ID NO.: 63) 

5730.7 

TTTTCCCCGTAGGGCTCCAC 
(SEQ ID NO.: 64) 

6003.9 

TTTTCCCCGTAGGGCTCCAGC 
(SEQ ID NO.: 65) 

6333.1 
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Position 1002 (G/A) 

PCR primers: 
5 Forward: 


10 


15 


20 


25 


Reverse: 


MassEXTEND™ Primer: 


Primer Mass: 


Extended Primer-Allele G: 


Extended Primer Mass: 


Extended Primer-Allele A: 


Extended Primer Mass: 


Position 960 (C/T) 

PCR primers: 
30 Forward: 


35 


Reverse: 


MassEXTEND'" Primer: 


40 


Primer Mass: 


Extended Primer-Allele T: 


TGCAGAAGTCACTGGCAGAG 
(SEQ ID NO.: 66) 

GTTGAAGTTTTCCCCGTAGG 
(SEQ ID NO.: 67) 

ACTCCTCCACCTGCTGGTC 
(SEQ ID NO.: 68) 

5675.7 

ACTCCTCCACCTGCTGGTCC 
(SEQ ID NO.: 69) 

5948.9 

ACTCCTCCACCTGCTGGTCTA 
(SEQ ID NO.: 70) 

6277.1 


AGGACGTGCGTGGCAACCTG 
(SEQ ID NO.: 71) 

AGCTCTGCCAGTGACTTCTG 
(SEQ ID NO.: 72) 

GTGACTTCTGCAGCCCCTC 
(SEQ ID NO.: 73) 

5715.7 

GTGACTTCTGCAGCCCCTCA 
(SEQ ID NO.: 74) 
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Extended Primer Mass: 
Extended Primer-Allele C: 

5 

Extended Primer Mass: 
Position 894 (C/T) 
10 PGR primers: 
Forward: 

15 Reverse: 

MassEXTEND™ Primer: 

20 

Primer Mass: 

Extended Primer-Allele C: 

25 

Extended Primer Mass: 
Extended Primer-Allele T: 

30 

Extended Primer Mass: 

Position 554 (G/A) 
PGR primers: 

35 

Forward: 
Reverse: 

40 

MassEXTEND™ Primer: 


-33- 


6012.9 

GTGAGTTGTGGAGGCCCTCGGT 
(SEQ ID NO.: 75) 

6662.3 


GGTGACGTTCGAGATGAAG 
(SEQ ID NO.: 76) 

TGAGGTTGGGAGGGAGGTC 
(SEQ ID NO.: 77) 

CAGGATGTCGGCCAGTGG 
(SEQ ID NO.: 78) 

5500.6 

GAGGATGTCGGCCAGTGGG 
(SEQ ID NO.: 79) 

5773.8 

GAGGATGTCGGCCAGTGCTG 
(SEQ ID NO.: 80) 

61 18.0 


AGGTGGGAGAGCTTCAGCAG 
(SEQ ID NO.: 81) 

TGTGGATGGGGTGTGCGTAG 
(SEQ ID NO.: 82) 

AGCTGCGGAGCCAGGTCA 
(SEQ ID NO.: 83) 
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Primer Mass: 

Extended Primer-Allele A: 

5 

Extended Primer Mass: 
Extended Primer-Allele G: 

10 

Extended Primer Mass: 
APOE 

15 Position 448 (C/T) 

PGR primers: 

Forward: 

20 

Reverse: 

MassEXTEND'" Primer: 

25 

Primer Mass: 

Extended Primer-Allele C: 

30 

Extended Primer Mass: 
Extended Primer-Allele T: 

35 

Extended Primer Mass: 


5469.6 

AGGTGGGCAGGCAGGTCAA 
(SEQ ID NO.: 84) 

5766.8 

AGCTGCGCACCCAGGTCAGC 
(SEQ ID NO.: 85) 

6072.0 


TGTCCAAGGAGCTGCAGGC 
(SEQ ID NO.: 86) 

CTTACGCAGCTTGCGCAGGT 
(SEQ ID NO.: 87) 

GCGGACATGGAGGACGTG 
(SEQ ID NO.: 88) 

5629.7 

GCGGACATGGAGGACGTGC 
(SEQ ID NO.: 89) 

5902.8 

GCGGACATGGAGGACGTGTG 
(SEQ ID NO.: 90) 

6247.1 
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LPL 


Position 1127 (A/G) 
PGR primers: 

5 

Forward: 
Reverse: 

10 

MassEXTEND™ Primer: 


15 Primer Mass: 

Extended Primer-Allele A: 


20 Extended Primer Mass: 
Extended Primer-Allele G: 


25 Extended Primer Mass: 

Position 3447 (A/C) 
PGR primers: 

30 Forward: 


Reverse: 

35 

MassEXTEND'" Primer: 


Primer Mass: 

40 

Extended Primer-Allele A: 


GTTGTAGAAAGAACGGGTGC 
(SEQ ID NO.: 91) 

GAGAAGGAGTGTTCAGGTAG 
(SEQ ID NO.: 92) 

AGAATGTGGGCTATGAGATCA 
(SEQ ID NO.: 93) 

6454.2 

AGAATCTGGGCTATGAGATCAA 
(SEQ ID NO.: 94) 

6751.4 

ACAATCTGGGCTATGAGATCAGT 
(SEQ ID NO.: 95) 

7071.6 


GAGTGTAGAGTGGATGTGTG 
(SEQ ID NO.: 96) 

ACCGTTGTGAAAAGGAGAGG 
(SEQ ID NO.: 97) 

GAGGAGAGAGAAGGGAGATA 
(SEQ ID NO.: 98) 

6273.1 

GAGGAGAGAGAAGGGAGATAT 
(SEQ ID NO.: 99) 
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Extended Primer Mass: 
Extended Primer-Allele C: 

5 

Extended Primer Mass: 

Position 1973 (C/T) 
PCR primers: 

10 

Forward: 
Reverse: 

15 

MassEXTEND™ Primer: 

20 Primer Mass: 

Extended Primer-Allele C: 

25 Extended Primer Mass: 
Extended Primer-Allele T: 

30 Extended Primer Mass: 
LI PC 

Position 680 <C/G) 
35 PCR primers: 

Forward: 
40 Reverse: 

MassEXTEND™ Primer: 


6561.3 

GAGGAGAGACAAGGCAGATAGT 
(SEQ ID NO.: 100) 

6890.5 


AAAGGTTCAGTTGCTGCTGC 
(SEQ ID NO.: 101) 

GCTGGGGAAGGTCTAATAAC 
(SEQ ID NO.: 102) 

GTTGCTGCTGCCTCGAATC 
(SEQ ID NO.: 103) 

5770.7 

GTTGCTGCTGCCTCGAATCC 
(SEQ ID NO.: 104) 

6043.9 

GTTGCTGCTGCCTCGAATCTG 
(SEQ ID NO.: 105) 

6388.2 


CGTCTTTCTCCAGATGATGC 
(SEQ ID NO.: 106) 

AGTGTCCTATGGGCTGTTTG 
(SEQ ID NO.: 107) 

GGATGCCATTCATACCTTTAC 
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Primer Mass: 
5 Extended Primer-Allele C: 

Extended Primer Mass: 
10 Extended Primer-Allele G: 

Extended Primer Mass: 

15 Position 1374 (G/A) 

PGR primers: 

Forward: 

20 

Reverse: 

MassEXTEND'" Primer: 

25 

Primer Mass: 

Extended Primer-Allele A: 

30 

Extended Primer Mass: 
Extended Primer-Allele G: 

35 

Extended Primer Mass: 
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(SEQ ID NO.: 108) 
6556.1 

GGATGCCATTCATACCTTTACC 
(SEQ ID NO.: 109) 

6629.3 

GGATGCCATTCATACCTTTACGC 
(SEQ ID NO.: 110) 

6958.5 

TGGGAAAACAGTGCAGTGTG 
(SEQ ID NO.: Ill) 

TGATCGTCTTCAGAACGAGG 
(SEQ ID NO.: 112) 

CCAGACCATCATCCCATGGA 
(SEQ ID NO.: 113) 

6030.9 

CCAGACCATCATCCCATGGAA 
(SEQ ID NO.: 114) 

6328.1 

CCAGACCATCATCCCATGGAGC 
(SEQ ID NO.: 115) 

6633.3 
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Position 701 (G/A) 
PGR primers: 

Forward: 

5 

Reverse: 

10 MassEXTEND^'^ Primer: 

Primer Mass: 
15 Extended Primer-Allele A: 

Extended Primer Mass: 
20 Extended Primer-Allele G: 

Extended Primer Mass: 


-38- 


CAGCAATCGTCTTTCTCCAG 
(SEQ ID NO.: 116) 

TCCTATGGGCTGTTTGATGC 
(SEQ ID NO.: 117) 

GTCTTTCTCCAGATGATGCCA 
(SEQ ID NO.: 118) 

6372.2 

GTCTTTCTCCAGATGATGCCAA 
(SEQ ID NO.: 119) 

6669.4 

GTCTTTCTCCAGATGATGCCAGT 
(SEQ ID NO.: 120) 

6989.6 


25 E. Databases 

Databases for determining an association between polymorphic 
regions of genes and intermediate and clinical phenotypes, contain 
biological samples (e.g., blood) that provide a source of nucleic acid and 
clinical data covering diseases {e.g., age, sex, ethnicity medical history 

30 and family medical history) from both individuals exhibiting the phenotype 
(intermediate phenotype (risk factor) or clinical phenotype (disease)) and 
those who do not. These databases include human population groups 
such as twins, diverse affected families, isolated founder populations and 
drug trial subjects. The quality and consistency of the clinical resources 

35 are of primary importance. 
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F. Association Studies 

The examples set forth below used an extreme trait analysis to 
discover an association between an allelic variant of the COX6B gene and 
high cholesterol and an association between an allelic variant of the GPI-1 
5 gene and low HDL. This analysis is based on comparing a pair of pools of 
DNA from individuals who exhibit respectively hypo or hypernormal levels 
of a biochemical trait {e,g,, cholesterol or HDL) and individually examining 
SNPs for a difference in allelic frequency between the pools. An 
association is considered to be positive if a statistically significant value 

10 of at least 3.841 using a 1 -degree-of-freedom chi-squared test of 

association, p = 0.05, is obtained. Standard multiple testing corrections 
are applied if more than one SNP is considered at a time, i.e., multiple 
SNPs are tested during the same study. Although not always required, it 
may be necessary to further examine the frequency of allelic variants in 

15 other populations, including those exhibiting normal levels of the given 
trait. 

For a qualitative trait (e.g., hypertension) association studies are 
based on determining the occurrence of certain alleles in a given 
population of diseased vs. healthy individuals. 

20 Allelic variants of C0X6B, GPI-1 and other genes found to 

associate with high cholesterol, low HDL and/or cardiovascular disease 
can represent useful markers for indicating a predisposition for developing 
a risk factor for cardiovascular disease. These allelic variants may not 
necessarily represent functional variants affecting the expression, 

25 stability, or activity of the encoded protein product. Those of skill in the 
art would be able to determine which allelic variants are to be used, alone 
or in conjunction with other variants, only for indicating a predisposition 
for cardiovascular disease or for profiling of drug reactivity and for 
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determining those that may be also useful for screening for potential 
therapeutics. 

Any method used to determine association can be used to discover 
or confirm the association of other polymorphic regions in the C0X6B 
5 gene, the GPI-1 gene or any other gene that may be associated with 
cardiovascular disease. 
G. Detection of Polymorphisms 

1 . Nucleic acid detection methods 

Generally, these methods are based in sequence-specific 

10 polynucleotides, oligonucleotides, probes and primers. Any method 

known to those of skill in the art for detecting a specific nucleotide within 
a nucleic acid sequence or for determining the identity of a specific 
nucleotide in a nucleic acid sequence is applicable to the methods of 
determining the presence or absence of an allelic variant of a C0X6B 

15 gene or GPI-1 gene or another gene associated with cardiovascular 
disease. Such methods include, but are not limited to, techniques 
utilizing nucleic acid hybridization of sequence-specific probes, nucleic 
acid sequencing, selective amplification, analysis of restriction enzyme 
digests of the nucleic acid, cleavage of mismatched heteroduplexes of 

20 nucleic acid and probe, alterations of electrophoretic mobility, primer 
specific extension, oligonucleotide ligation assay and single-stranded 
conformation polymorphism analysis. In particular, primer extension 
reactions that specifically terminate by incorporating a dideoxynucleotide 
are useful for detection. Several such general nucleic acid detection 

25 assays are described in U.S. Patent No. 6,030,778. 
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a. Primer extension-based metliods 

Several primer extension-based methods for determining the 
identity of a particular nucleotide in a nucleic acid sequence have been 
reported (see, e,g,, PCT Application No. PCT/US96/03651 
5 (W096/29431), PCT Application No. PCT/US97/20444 (WO 98/20019), 
PCT Application No. PCT/US91 /00046 (W091 /I 3075), and U.S. Patent 
No. 5,856,092). In general, a primer is prepared that specifically 
hybridizes adjacent to a polymorphic site in a particular nucleic acid 
sequence. The primer is then extended in the presence of one or more 

10 dideoxynucleotides, typically with at least one of the dideoxynucleotides 
being the complement of the nucleotide that is polymorphic at the site. 
The primer and/or the dideoxynucleotides may be labeled to facilitate a 
determination of primer extension and identity of the extended nucleotide. 
In one method, primer extension and/or the identity of the extended 

15 nucleotide(s) are determined by mass spectrometry (see, e,g., PCT 

Application Nos. PCT/US96/03651 {W096/29431) and PCT/US97/20444 
(WO 98/20019)). 

b. Polymorphism-specific probe hybridization 
One exemplary detection method is allele specific hybridization 

20 using probes overlapping the polymorphic site and having about 5, 10, 

15, 20, 25, or 30 nucleotides around the polymorphic region. The probes 
can contain aturally occurring or modified nucleotides (see U.S. Patent 
No. 6,156,501). For example, oligonucleotide probes may be prepared in 
which the known polymorphic nucleotide is placed centrally (allele- 

25 specific probes) and then hybridized to target DNA under conditions that 
permit hybridization only if a perfect match is found (Saiki et al. (1986) 
Nature 324:163; Saiki et al. (1989) Proc. Natl Acad. Sci USA 86:6230; 
and Wallace et aL (1979) Nucl. Acids Res. 6:3543). Such allele specific 
oligonucleotide hybridization techniques may be used for the simultaneous 
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detection of several nucleotide changes in different polynnorphic regions. 
For exannple, oligonucleotides having nucleotide sequences of specific 
allelic variants are attached to a hybridizing membrane and this membrane 
is then hybridized with labeled sample nucleic acid. Analysis of the 
5 hybridization signal will then reveal the identity of the nucleotides of the 
sample nucleic acid. In one embodiment, several probes capable of 
hybridizing specifically to allelic variants are attached to a solid phase 
support, e.g,, a "chip". Oligonucleotides can be bound to a solid support 
by a variety of processes, including lithography. For example a chip can 

10 hold up to 250,000 oligonucleotides (GeneChip, Affymetrix, Santa Clara, 
CA). Mutation detection analysis using these chips comprising 
oligonucleotides, also termed "DNA probe arrays" is described e.g., in 
Cronin et aL (1996) Human Mutation 7:244 and in Kozal et al. (1996) 
Nature Medicine 2:753. In one embodiment, a chip includes all the allelic 

15 variants of at least one polymorphic region of a gene. The solid phase 

support is then contacted with a test nucleic acid and hybridization to the 
specific probes is detected. Accordingly, the identity of numerous allelic 
variants of one or more genes can be identified in a simple hybridization 
experiment. 

20 c. Nucleic acid amplification-based methods 

In other detection methods, it is necessary to first amplify at least a 
portion of a COX6B gene, GPI-1 gene or another gene associated with 
cardiovascular disease prior to identifying the allelic variant. Amplification 
can be performed, e.g., by PGR and/or LCR, according to methods known 
25 in the art. In one embodiment, genomic DNA of a cell is exposed to two 
PGR primers and amplification is performed for a number of cycles 
sufficient to produce the required amount of amplified DNA. In certain 
embodiments, the primers are located between 150 and 350 base pairs 
apart. 


wo 02/072604 


PCT/US02/06728 


-43" 

Alternative amplification methods include: self sustained sequence 
replication (Guatelli, J. C. et aL (1990) Proc. Natl. Acad. Sci. U.S.A. 
87:1874-1878); transcriptional amplification system (Kwoh, D. Y. et al. 
(1989) Proc. Natl. Acad. Sci. U.S.A. 86:1173-1177); Q-Beta Replicase 
5 (Lizardi, P. M. et aL (1988) Bio/Technology 6:1197) and any other nucleic 
acid amplification method, followed by the detection of the amplified 
molecules using techniques well known to those of skill in the art. These 
detection schemes are also useful for the detection of nucleic acid 
molecules if such molecules are present in very low numbers. 

10 Alternatively, allele specific amplification technology, which 

depends on selective PGR amplification may be used in conjunction with 
the alleles provided herein. Oligonucleotides used as primers for specific 
amplification may carry the allelic variant of interest in the center of the 
molecule (so that amplification depends on differential hybridization) 

15 (Gibbs et al. (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 
3' end of one primer where, under appropriate conditions, mismatch can 
prevent, or reduce polymerase extension (Prossner (1993) Tibtech 
11:238; Newton et aL (1989) Nucl. Acids Res. 17:2503). In addition it 
may be desirable to introduce a restriction site in the region of the 

20 mutation to create cleavage-based detection (Gasparini et aL (1992) Mol. 
Cell Probes 6:1). 

d. Nucleic acid sequencing-based methods 
In one embodiment, any of a variety of sequencing reactions 
known in the art can be used to directly sequence at least a portion of the 

25 COX6B gene, GPI-1 gene or other gene associated with cardiovascular 
disease and to detect allelic variants, e.g., mutations, by comparing the 
sequence of the sample sequence with the corresponding wild-type 
(control) sequence. Exemplary sequencing reactions include those based 
on techniques developed by Maxam and Gilbert (Proc. Natl. Acad. Sci. 
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USA (1977) 74:560) or Sanger (Sanger et aL (1977) Proc. Natl. Acad. Sci 
74:5463). It is also contemplated that any of a variety of automated 
sequencing procedures may be used when performing the subject assays 
(Biotechniques (1995) 19:448), including sequencing by mass 
5 spectrometry (see, for example, U.S. Patent No. 5,547,835 and 
International PCT Application No. WO 94/16101, entitled DNA 
Sequencing by Mass Spectrometry by H. Koster; U.S. Patent No. 
5,547,835 and International PCT Application No. WO 94/21822, entitled 
"DNA Sequencing by Mass Spectrometry Via Exonuclease Degradation" 

10 by H. Koster), and U.S. Pat. No. 5,605,798 and International Patent 
Application No. PCT/US96/03651 entitled DNA Diagnostics Based on 
Mass Spectrometry by H. Koster; Cohen et al. (1996) Adv Chromatogr 
36:127-162; and Griffin et aL (1993) AppI Biochem Biotechnol 
38:147-159). It will be evident to one skilled in the art that, for certain 

15 embodiments, the occurrence of only one, two or three of the nucleic 

acid bases need be determined in the sequencing reaction. For instance, 
A-track sequencing or an equivalent, e.g., where only one nucleotide is 
detected, can be carried out. Other sequencing methods are disclosed, 
e.g., in U.S. Patent No. 5,580,732 entitled "Method of DNA sequencing 

20 employing a mixed DNA-polymer chain probe" and U.S. Patent No, 
5,571,676 entitled "Method for mismatch-directed in vitro DNA 
sequencing". 

e. Restriction enzyme digest analysis 

In some cases, the presence of a specific allele in nucleic acid, 
25 particularly DNA, from a subject can be shown by restriction enzyme 

analysis. For example, a specific nucleotide polymorphism can result in a 
nucleotide sequence containing a restriction site that is absent from the 
nucleotide sequence of another allelic variant. 
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f. Mismatch Cleavage 

Protection from cleavage agents, such as, but not limited to, a 
nuclease, hydroxylamine or osmium tetroxide and with piperidine, can be 
used to detect mismatched bases in RNA/RNA DNA/DNA, or RNA/DNA 
5 heteroduplexes (Myers, et al. (1985) Science 230:1242). In general, the 
technique of "mismatch cleavage" starts by providing heteroduplexes 
formed by hybridizing a control nucleic acid, which is optionally labeled, 
e.g., RNA or DNA, comprising a nucleotide sequence of an allelic variant 
with a sample nucleic acid, e.g, RNA or DNA, obtained from a tissue 

10 sample. The double-stranded duplexes are treated with an agent, which 
cleaves single-stranded regions of the duplex such as duplexes formed 
based on basepair mismatches between the control and sample strands. 
For instance, RNA/DNA duplexes can be treated with RNase and 
DNA/DNA hybrids treated with SI nuclease to enzymatically digest the 

15 mismatched regions. 

In other embodiments, either DNA/DNA or RNA/DNA duplexes can 
be treated with hydroxylamine or osmium tetroxide and with piperidine in 
order to digest mismatched regions. After digestion of the mismatched 
regions, the resulting material is then separated by size on denaturing 

20 polyacrylamide gels to determine whether the control and sample nucleic 
acids have an identical nucleotide sequence or in which nucleotides they 
differ (see, for example. Cotton et aL (1988) Proc. Natl Acad Sci USA 
85:4397; Saleeba et al. (1992) Methods Enzymod. 217:286-295). The 
control or sample nucleic acid is labeled for detection. 

25 g. Electrophoretic mobility alterations 

In other embodiments, alteration in electrophoretic mobility is used 
to identify the type of allelic variant in the C0X6B gene, GPI-1 gene or 
other gene associated with cardiovascular disease. For example, single- 
strand conformation polymorphism (SSCP) may be used to detect 
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differences in electrophoretic mobility between mutant and wild type 
nucleic acids (Orita et aL (1989) Proc. Natl. Acad. Sci. USA 86:2766, see 
also Cotton (1993) Mutat Res 285:125-144; and Hayashi (1992) Genet 
Anal Tech AppI 9:73-79). Single-stranded DNA fragments of sample and 
5 control nucleic acids are denatured and allowed to renature. The 

secondary structure of single-stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic mobility enables the 
detection of even a single base change. The DNA fragments may be 
labeled or detected with labeled probes. The sensitivity of the assay may 

10 be enhanced by using RNA (rather than DNA), in which the secondary 
structure is more sensitive to a change in sequence. In another 
embodiment, the subject method uses heteroduplex analysis to separate 
double stranded heteroduplex molecules on the basis of changes in 
electrophoretic mobility (Keen et al. (1991) Trends Genet 7:5). 

15 h. Polyacrylamide Gel Electrophoresis 

In yet another embodiment, the identity of an allelic variant of a 
polymorphic region in the COX6B gene, GPI-1 gene or other gene 
associated with cardiovascular disease is obtained by analyzing the 
movement of a nucleic acid comprising the polymorphic region in 

20 polyacrylamide gels containing a gradient of denaturant is assayed using 
denaturing gradient gel electrophoresis (DGGE) (Myers et aL (1985) 
Nature 313:495). When DGGE is used as the method of analysis, DNA 
will be modified to ensure that it does not completely denature, for 
example by adding a GC clamp of approximately 40 bp of high-melting 

25 GC-rich DNA by PGR. In a further embodiment, a temperature gradient is 
used in place of a denaturing agent gradient to identify differences in the 
mobility of control and sample DNA (Rosenbaum and Reissner (1987) 
Biophys Chem 265:1275). 
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i. Oligonucleotide ligation assay (OLA) 

In another embodinnent, identification of the allelic variant is carried 
out using an oligonucleotide ligation assay (OLA), as described, e.g., in 
U.S. Patent No. 4,998,617 and in Landegren, U. et al., Science 
5 241:1077-1080 (1988). The OLA protocol uses two oligonucleotides that 
are designed to be capable of hybridizing to abutting sequences of a 
single strand of a target. One of the oligonucleotides is linked to a 
separation marker, e.g,. biotinylated, and the other is detectably labeled. 
If the precise complementary sequence is found in a target molecule, the 

10 oligonucleotides will hybridize such that their termini abut, and create a 
ligation substrate. Ligation then permits the labeled oligonucleotide to be 
recovered using avidin, or another biotin ligand. Nickerson, D. A. et aL 
have described a nucleic acid detection assay that combines attributes of 
PGR and OLA (Nickerson, D. A. et aL, Proc. Natl. Acad. Sci. (U.S.A.) 

15 87:8923-8927 (1990). In this method, PGR is used to achieve the 

exponential amplification of target DNA, which is then detected using 
OLA. 

Several techniques based on this OLA method have been developed 
and can be used to detect specific allelic variants of a polymorphic region 

20 of a gene. For example, U.S. Pat. No. 5,593,826 discloses an OLA using 
an oligonucleotide having 3'-amino group and a 5'- phosphorylated 
oligonucleotide to form a conjugate having a phosphoramidate linkage. In 
another variation of OLA described in Tobe et al. (1996) Nucl. Acids Res. 
24: 3728), OLA combined with PGR permits typing of two alleles in a 

25 single microtiter well. By marking each of the allele-specific primers with a 
unique hapten, i.e. digoxigenin and fluorescein, each OLA reaction can be 
detected by using hapten specific antibodies that are labeled with 
different enzyme reporters, alkaline phosphatase or horseradish 
peroxidase. This system permits the detection of the two alleles using a 


wo 02/072604 


PCT/US02/06728 


-48- 

high throughput format that leads to the production of two different 
colors. 

j. SNP detection methods 

Also provided are methods for detecting single nucleotide 
5 polymorphisms. Because single nucleotide polymorphisms constitute sites 
of variation flanked by regions of invariant sequence, their analysis 
requires no more than the determination of the identity of the single 
nucleotide present at the site of variation and it is unnecessary to 
determine a complete gene sequence for each patient. Several methods 
10 have been developed to facilitate the analysis of such single nucleotide 
polymorphisms. 

In one embodiment, the single base polymorphism can be detected 
by using a specialized exonuclease-resistant nucleotide, as disclosed, 
e.g., in Mundy, C. R. (U.S. Patent No. 4,656,127). According to the 

15 method, a primer complementary to the allelic sequence immediately 3' to 
the polymorphic site is permitted to hybridize to a target molecule 
obtained from a particular animal or human. If the polymorphic site on the 
target molecule contains a nucleotide that is complementary to the 
particular exonuclease-resistant nucleotide derivative present, then that 

20 derivative will be incorporated onto the end of the hybridized primer. 
Such incorporation renders the primer resistant to exonuclease, and 
thereby permits its detection. Since the identity of the 
exonuclease-resistant derivative of the sample is known, a finding that 
the primer has become resistant to exonucleases reveals that the 

25 nucleotide present in the polymorphic site of the target molecule was 
complementary to that of the nucleotide derivative used in the reaction. 
This method has the advantage that it does not require the determination 
of large amounts of extraneous sequence data. 
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In another embodiment, a solution-based method for determining 
the identity of the nucleotide of a polymorphic site is employed (Cohen, 
D. etaL (French Patent 2,650,840; PCT Application No. W091 /02087)). 
As in the Mundy method of U.S. Patent No. 4,656,127, a primer is 
5 employed that is complementary to allelic sequences immediately 3' to a 
polymorphic site. The method deternnines the identity of the nucleotide of 
that site using labeled dideoxynucleotide derivatives, which, if 
complementary to the nucleotide of the polymorphic site will become 
incorporated onto the terminus of the primer, 

10 k. Genetic Bit Analysis 

An alternative method, known as Genetic Bit Analysis or GBA''' is 
described by Goelet, et aL (U.S. Patent No. 6,004,744, PCT Application 
No. 92/15712). The method of Goelet, et aL uses mixtures of labeled 
terminators and a primer that is complementary to the sequence 3' to a 

15 polymorphic site. The labeled terminator that is incorporated is thus 
determined by, and complementary to, the nucleotide present in the 
polymorphic site of the target molecule being evaluated. In contrast to 
the method of Cohen et aL (French Patent 2,650,840; PCT Application 
No. WO91/02087), the method of Goelet, et aL is typically a hetero- 

20 geneous phase assay, in which the primer or the target molecule is 

immobilized to a solid phase. 

I. Other primer-guided nucleotide incorporation 
procedures 

Other primer-guided nucleotide incorporation procedures for 
25 assaying polymorphic sites in DNA have been described (Komher, J. S. et 
aL, Nucl. Acids Res. 17:7779-7784 (1989); Sokolov, B. P., Nucl. Acids 
Res. 18:3671 (1990); Syvanen, A. C, et aL , Genomics 8:684-692 
(1990), Kuppuswamy, M. N. et aL, Proc. Natl. Acad. Sci. (U.S.A.) 
88:1143-1147 (1991); Prezant, T. R. et aL, Hum. Mutat. 1:159-164 
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(1992); Ugozzoli, L. etaL^ GATA 9:107-1 12 (1992); Nyren, P. et aL^ 
Anal. Biochem. 208:171-175 (1993)). These methods differ from GBA'" 
in that they all rely on the incorporation of labeled deoxynucleotides to 
discriminate between bases at a polymorphic site. In such a format, since 
5 the signal is proportional to the number of deoxynucleotides incorporated, 
polymorphisms that occur in runs of the same nucleotide can result in 
signals that are proportional to the length of the run (Syvanen, A. C, et 
aL, Amer. J. Hum. Genet. 52:46-59 (1993)). 

For determining the identity of the allelic variant of a polymorphic 

10 region located in the coding region of a gene, yet other methods than 
those described above can be used. For example, identification of an 
allelic variant that encodes a mutated protein can be performed by using 
an antibody specifically recognizing the mutant protein in, e.g., 
immunohistochemistry or immunoprecipitation. Binding assays are 

15 known in the art and involve, e.g., obtaining cells from a subject, and 

performing binding experiments with a labeled lipid, to determine whether 
binding to the mutated form of the protein differs from binding to the 
wild-type protein. 

m. Molecular structure determination 

20 If a polymorphic region is located in an exon, either in a coding or 

non-coding region of the gene, the identity of the allelic variant can be 
determined by determining the molecular structure of the mRNA, 
pre-mRNA, or cDNA. The molecular structure can be determined using 
any of the above described methods for determining the molecular 

25 structure of the genomic DNA, e.g,, sequencing and SSCP. 
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n. Mass spectrometric methods 

Nucleic acids also can be analyzed by detection methods and 
protocols, particularly those that rely on mass spectrometry (see, e.g., 
U.S. Patent No. 5,605,798, allowed co-pending U.S. Application Serial 
5 No. 08/617,256, allowed co-pending U.S. Application Serial No. 

08/744,481, U.S. Application Serial No. 08/990,851, International PCT 
Application No. WO 98/20019). These methods can be automated (see, 
e.g., co-pending U.S. Application Serial No. 09/285,481, which describes 
an automated process line). Among the methods of analysis herein are 

10 those involving the primer oligo base extension (PROBE) reaction with 
mass spectrometry for detection (described herein and elsewhere, see 
e.g., U.S. Application Serial Nos. 08/617,256, 09/287,681, 09/287,682, 
09/287,141 and 09/287,679, allowed co-pending U.S. Application Serial 
No. 08/744,481, International PCT Application No. PCT/US97/20444, 

15 published as International PCT Application No. WO 98/20019, and based 
upon U.S. Application Serial Nos. 08/744,481, 08/744,590, 08/746,036, 
08/746,055, 08/786,988, 08/787,639, 08/933,792, 08/746,055, 
08/786,988 and 08/787,639; see, also U.S. Application Serial No. 
09/074,936, allowed U.S. Application Serial No. 08/787,639, and U.S. 

20 Application Serial Nos. 08/746,055 and 08/786,988, and published 
International PCT Application No. WO 98/20020). 

One format for performing the analyses is a chip based format in 
which the biopolymer is linked to a solid support, such as a silicon or 
silicon-coated substrate, typicaly in the form of an addressable array. 

25 Typically when analyses are performed using mass spectrometry, 

particularly MALDI, nanoliter volumes of sample are loaded on, such that 
the resulting spot is about, or smaller than, the size of the laser spot. It 
has been found that when this is achieved, the results from the mass 
spectrometric analysis are quantitative. The area under the peaks in the 
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resulting mass spectra are proportional to concentration (when nornnalized 
and corrected for background). Methods for preparing and using such 
chips are described in allowed co-pending U.S. Application Serial No. 
08/787,639, co-pending U.S. Application Serial Nos. 08/786,988, 
5 09/364,774, 09/371,150 and 09/297,575; see, also U.S. Application 
Serial No. PCT/US97/201 95, which published as International PCT 
Application No. WO 98/20020. Chips and kits for performing these 
analyses are commercially available from SEQUENOM under the 
trademark MassARRAY'". MassARRAY™ relies on the fidelity of the 

10 enzymatic primer extension reactions combined with the miniaturized 

array and MALDI-TOF (Matrix-Assisted Laser Desorption lonization-Time 
of Flight) mass spectrometry to deliver results rapidly. It accurately 
distinguishes single base changes in the size of DNA fragments relating to 
genetic variants without tags. 

15 Multiplex methods allow for the simultaneous detection of more 

than one polymorphic region in a particular gene or polymorphic regions in 
several genes. This is the one exemplary method for carrying out 
haplotype analysis of allelic variants of the COX6B and/or GPI-1 genes 
separately, or along with allelic variants of one or more other genes 

20 associated with cardiovascular disease. 

Multiplexing can be achieved by several different methodologies. 
For example, several mutations can be simultaneously detected on one 
target sequence by employing corresponding detector (probe) molecules 
(e.g,, oligonucleotides or oligonucleotide mimetics). The molecular weight 

25 differences between the detector oligonucleotides must be large enough 
so that simultaneous detection (multiplexing) is possible. This can be 
achieved either by the sequence itself (composition or length) or by the 
introduction of mass-modifying functionalities into the detector 
oligonucleotides (see below). 
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Mass modifying moieties can be attached, for instance, to either 
the 5'-end of the oligonucleotide, to the nucleobase (or bases), to the 
phosphate backbone, and to the 2'-position of the nucleoside 
(nucleosides) and/or to the terminal 3'-position. Examples of mass 
5 modifying moieties include, for example, a halogen, an azido, or of the 
type, XR, wherein X is a linking group and R is a mass-modifying 
functionality. The mass-modifying functionality can thus be used to 
introduce defined mass increments into the oligonucleotide molecule. 
The mass-modifying functionality can be located at different 

10 positions within the nucleotide moiety (see, e.g., U.S. Patent No. 

5,547,835 and International PCT Application No. WO 94/21822). For 
example, the mass-modifying moiety, M, can be attached either to the 
nucleobase, (in case of the -deazanucleosides also to C-7), to the 
triphosphate group at the alpha phosphate or to the 2'-position of the 

15 sugar ring of the nucleoside triphosphate. Modifications introduced at the 
phosphodiester bond, such as with alpha-thio nucleoside triphosphates, 
have the advantage that these modifications do not interfere with 
accurate Watson-Crick base-pairing and additionally allow for the one-step 
post-synthetic site-specific modification of the complete nucleic acid 

20 molecule e.g., via alkylation reactions (see, e.g., Nakamaye et aL (1988) 
Nucl. Acids Res. 16:9947-59). Exemplary mass-modifying functionalities 
are boron-modified nucleic acids since they are better incorporated into 
nucleic acids by polymerases (see, e.g.. Porter et al. (1995) Biochemistry 
34:11963-11969; Hasan et aL (1996) Nucleic Acids Res. 24:2150-2157; 

25 Li et al. (1995) Nucl. Acids Res. 23:4495-4501). 

Furthermore, the mass-modifying functionality can be added so as 
to affect chain termination, such as by attaching it to the 3'-position of 
the sugar ring in the nucleoside triphosphate. For those skilled in the art, 
it is clear that many combinations can be used in the methods provided 
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herein. In the same way, those skilled in the art will recognize that 
chain-elongating nucleoside triphosphates also can be mass-modified in a 
similar fashion with numerous variations and combinations in functionality 
and attachment positions. 
5 For example, without being bound to any particular theory, the 

mass-modification can be introduced for X in XR as well as using 
oligo-/polyethylene glycol derivatives for R. The mass-modifying 
increment (m) in this case is 44, i.e. five different mass-modified species 
can be generated by just changing m from 0 to 4 thus adding mass units 

10 of 45 (m = 0), 89 {m=1), 133 (m = 2), 177 (m = 3) and 221 (m = 4) to the 
nucleic acid molecule (e.g., detector oligonucleotide (D) or the nucleoside 
triphosphates, respectively). The oligo/polyethylene glycols also can be 
monoalkylated by a lower alkyi such as, but are not limited to, methyl, 
ethyl, propyl, isopropyl and t-butyl. Other chemistries can be used in the 

15 mass-modified compounds (see, e.g., those described in Oligonucleotides 
and Analogues, A Practical Approach, F. Eckstein, editor, IRL Press, 
Oxford, 1991). 

In yet another embodiment, various mass-modifying functionalities, 
R, other than oligo/polyethylene glycols, can be selected and attached via 

20 appropriate linking chemistries, X. A simple mass-modification can be 
achieved by substituting H for halogens, such as F, CI, Br and/or I, or 
pseudohalogens such as CN, SCN, NCS, or by using different alkyI, aryl 
or aralkyi moieties such as methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, 
phenyl, substituted phenyl, benzyl, or functional groups such as CHjF, 

25 CHF2, CF3, Si(CH3)3, Si(CH3)2(C2H5), Si(CH3)(C2H5)2, Si(C2H5)3. Yet 
another mass-modification can be obtained by attaching homo- or 
heteropeptides through the nucleic acid molecule (e.g., detector (D)) or 
nucleoside triphosphates). One example, useful in generating mass- 
modified species with a mass increment of 57, is the attachment of 
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oligoglycines (m) to nucleic acid molecules (r), e,g.r mass-modifications of 
74 (r=l, m = 0), 131 (r=1, m=1), 1 88 (r = 1 , m = 2), 245 (r=1, m = 3) 
are achieved. Simple oligoamides also can be used, e.g., but not limited 
to, mass-modifications of 74 (r = 1 , m = 0), 88 (r = 2, m = 0), 1 02 (r = 3, 
5 m = 0), 1 1 6(r = 4, m = 0), are obtainable. Variations in additions to those 
set forth herein will be apparent to the skilled artisan. 

Different mass-modified detector oligonucleotides can be used to 
simultaneously detect all possible variants/mutants simultaneously. 
Alternatively, all four base permutations at the site of a mutation can be 

10 detected by designing and positioning a detector oligonucleotide, so that 
it serves as a primer for a DNA/RNA polymerase with varying 
combinations of elongating and terminating nucleoside triphosphates. For 
example, mass modifications also can be incorporated during the 
amplification process. 

15 A different multiplex detection format is one in which differentiation 

is accomplished by employing different specific capture sequences that 
are position-specifically immobilized on a flat surface {e.g,, a 'chip array'). 
If different target sequences Tl-Tn are present, their target capture sites 
TCSI-TCSn will specifically interact with complementary immobilized 

20 capture sequences Cl-Cn. Detection is achieved by employing 

appropriately mass differentiated detector oligonucleotides Dl-Dn, which 
are mass modifying functionalities Ml-Mn. 
o. Other methods 
Additional methods of analyzing nucleic acids include amplification- 

25 based methods including polymerase chain reaction (PCR), ligase chain 
reaction (LCR), mini-PCR, rolling circle amplification, autocatalytic 
methods, such as those using QJ replicase, TAS, 3SR, and any other 
suitable method known to those of skill in the art. 
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Other methods for analysis and identification and detection of 
polymorphisms, include but are not limited to, allele specific probes, 
Southern analyses, and other such analyses. 
2. Primers and probes 
5 Primers refer to nucleic acids that are capable of specifically 

hybridizing to a nucleic acid sequence that is adjacent to a polymorphic 
region of interest or to a polymorphic region and are extended. A primer 
can be used alone in a detection method, or a primer can be used 
together with at least one other primer or probe in a detection method. 

10 Primers also can be used to amplify at least a portion of a nucleic acid. 
For amplifying at least a portion of a nucleic acid, a forward primer (/.e. , 
5' primer) and a reverse primer {i.e., 3' primer) typically will be used. 
Forward and reverse primers hybridize to complementary stands of a 
double stranded nucleic acid, such that upon extension from each primer, 

15 a double stranded nucleic acid is amplified. 

Probes refer to nucleic acids that hybridize to the region of interest 
and that are not further extended. For example, a probe is a nucleic acid 
that hybridizes adjacent to or at a polymorphic region of a C0X6B gene, a 
GPI-1 gene or another gene associated with cardiovascular disease and 

20 that by hybridization or absence of hybridization to the DNA of a subject 
will be indicative of the identity of the allelic variant of the polymorphic 
region of the gene. Exemplary probes have a number of nucleotides 
sufficient to allow specific hybridization to the target nucleotide 
sequence. Where the target nucleotide sequence is present in a large 

25 fragment of DNA, such as a genomic DNA fragment of several tens or 
hundreds of kilobases, the size of a probe may have to be longer to 
provide sufficiently specific hybridization, as compared to a probe that is 
used to detect a target sequence that is present in a shorter fragment of 
DNA. For example, in some diagnostic methods, a portion of a COX6B 
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gene, a GPI-1 gene or another gene associated with cardiovascular 
disease nnay first be amplified and thus isolated from the rest of the 
chromosomal DNA and then hybridized to a probe. In such a situation, a 
shorter probe will likely provide sufficient specificity of hybridization. For 
5 example, a probe having a nucleotide sequence of about 10 nucleotides 
may be sufficient. 

Exemplary primers and probes hybridize adjacent to or at the 
polymorphic sites described in TABLES 1-3. In addition, primers include 
SEQ ID NOS.: 5, 10, 43, 48, 53, 58, 63, 68, 73, 78, 83, 88, 93, 98, 

10 103, 108, 113, and 118. 

Primers and probes (RNA, DNA (single-stranded or double- 
stranded), PNA and their analogs) described herein may be labeled with 
any detectable reporter or signal moiety including, but not limited to 
radioisotopes, enzymes, antigens, antibodies, spectrophotometric 

15 reagents, chemiluminescent reagents, fluorescent and any other light 

producing chemicals. Additionally, these probes may be modified without 
changing the substance of their purpose by terminal addition of 
nucleotides designed to incorporate restriction sites or other useful 
sequences, proteins, signal generating ligands such as acridinium esters, 

20 and/or paramagnetic particles. 

These probes may also be modified by the addition of a capture 
moiety (including, but not limited to para-magnetic particles, biotin, 
fluorescein, dioxigenin, antigens, antibodies) or attached to the walls of 
microtiter trays to assist in the solid phase capture and purification of 

25 these probes and any DNA or RNA hybridized to these probes. 

Fluorescein may be used as a signal moiety as well as a capture moiety, 
the latter by interacting with an anti-fluorescein antibody. 

Any probe or primer can be prepared according to methods well 
known in the art and described, e,g,, in Sambrook, J. Fritsch, E.F., and 
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Maniatis, T. (1989( Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y. For example, discrete 
fragments of the DNA can be prepared and cloned using restriction 
enzymes. Alternatively, probes and primers can be prepared using the 
5 Polymerase Chain Reaction (PCR) using primers having an appropriate 
sequence. 

Oligonucleotides may be synthesized by standard methods known 
in the art, e,g. by use of an automated DNA synthesizer (such as are 
commercially available from, numerous sources, such as Biosearch 

10 (Novate, CA); and Applied Biosystems (Foster City, CA)). As examples, 
phosphorothioate oligonucleotides may be synthesized by the method of 
Stein et aL {(1988) Nucl. Acids Res. 16:3209), methylphosphonate 
oligonucleotides can be prepared by use of controlled pore glass polymer 
supports (Sarin et al., 1988, Proc. Natl. Acad. Sci. U.S.A. 85:7448- 

15 7451), and others. 

H. Transgenic Animals 

Methods for making transgenic animals using a variety of 
transgenes are known (see, e.g., Wagner et al. (1981) Proc. Nat. Acad. 
Sc. U.S.A. 78:5016; Stewart et al. (1982) Science 217:1046; 

20 Constantini et aL (1981) Nature 294:92; Lacy et aL (1982) Cell 34:343; 
McKnight et aL (1983) Cell 34:335; Brinstar ef a/. (1983) Nature 
306:332; Palmiter et aL (1982) Nature 300:611; Palmiter et aL (1982) 
Cell 29:701 and Palmiter et aL (1983) Science 222:809; and U.S. Patent 
Nos. 6,175,057; 6,180,849 and 6,133,502). 

25 Transgenic animals contain an exogenous nucleic acid sequence 

present as an extrachromosomal element or stably integrated in all or a 
portion of its cells, especially germ cells. Unless otherwise indicated, it 
will be assumed that a transgenic animal contains stable changes to the 
germline sequence. During the initial construction of the animal. 
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"chimeras" or "chimeric animals" are generated, in which only a subset of 
cells have the altered genome. Chimeras are primarily used for breeding 
purposes in order to generate the desired transgenic animal. Animals 
having a heterozygous alteration are generated by breeding of chimeras. 
5 Male and female heterozygotes are typically bred to generate homozygous 
animals. 

The exogenous gene is usually either from a different species than 
the animal host, or is otherwise altered in its coding or non-coding 
sequence. The introduced gene may be a wild-type gene, naturally 

10 occurring polymorphism (e.g., as described for C0X6B, GPI-1 and other 
genes associated with cardiovascular disease) or a genetically 
manipulated sequence, for example having deletions, substitutions or 
insertions in the coding or non-coding regions. When the introduced gene 
is a coding sequence, it is usually operably linked to a promoter, which 

15 may be constitutive or inducible, and other regulatory sequences required 
for expression in the host animal. 

Transgenic animals can contain other genetic alterations in addition 
to the presence of alleles of C0X6B and/or GPI-1 genes. For example, 
the genome can be altered to affect the function of the endogenous 

20 genes, contain marker genes, or contain other genetic alterations {e.g., 
alleles of other genes associated with cardiovascular disease). 

A "knock-out" of a gene means an alteration in the sequence of the 
gene that results in a decrease of function of the target gene, typically 
such that target gene expression is undetectable or insignificant. A 

25 knock-out of an endogenous C0X6B or GPI-1 gene means that function 
of the gene has been substantially decreased so that expression is not 
detectable or only present at insignificant levels. "Knock-out" transgenics 
can be transgenic animals having a heterozygous knock-out of the C0X6B 
or GPI-1 gene or a homozygous knock-out of one or both of these genes. 
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"Knock-outs" also include conditional knock-outs, where alteration of the 
target gene can occur upon, for exannple, exposure of the animal to a 
substance that promotes target gene alteration, introduction of an enzyme 
that promotes recombination at the target gene site {e,g,, Cre in the Cre- 
5 lox system), or other method for directing the target gene alteration 
postnatally. 

A "knock-in" of a target gene means an alteration in a host cell 
genome that results in altered expression (e.g., increased (including 
ectopic)) of the target gene, e.g., by introduction of an additional copy of 

10 the target gene, or by operatively inserting a regulatory sequence that 
provides for enhanced expression of an endogenous copy of the target 
gene. "Knock-in" transgenics of interest can be transgenic animals having 
a knock-in of the COX6B or GPI-1 . Such transgenics can be heterozygous 
or homozygous for the knock-in gene. "Knock-ins" also encompass 

15 conditional knock-ins. 

A construct is suitable for use in the generation of transgenic 
animals if it allows the desired level of expression of a COX6B or GPI-1 
encoding sequence or the encoding sequence of another gene associated 
with cardiovascular disease. Methods of isolating and cloning a desired 

20 sequence, as well as suitable constructs for expression of a selected 
sequence in a host animal, are well known in the art and are described 
below. 

For the introduction of a gene into the subject animal, it is generally 
advantageous to use the gene as a gene construct wherein the gene is 
25 ligated downstream of a promoter capable of and operably linked to 

expressing the gene in the subject animal cells. Specifically, a transgenic 
non-human mammal showing high expression of the desired gene can be 
created by microinjecting a vector ligated with said gene into a fertilized 
egg of the subject non-human mammal (e.g., rat fertilized egg) 
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downstream of various promoters capable of expressing the protein 
and/or the corresponding protein derived from various mammals (rabbits, 
dogs, cats, guinea pigs, hamsters, rats, mice and other mammals) 

Useful vectors include Escherichia coli-derived plasmids. Bacillus 
5 subtilis-derived plasmids, yeast-derived plasmids, bacteriophages such as 
lambda, phage, retroviruses such as Moloney leukemia virus, and animal 
viruses such as vaccinia virus or baculovirus. 

Useful promoters for such gene expression regulation include, for 
example, promoters for genes derived from viruses (cytomegalovirus, 

10 Moloney leukemia virus, JC virus, breast cancer virus and others), and 
promoters for genes derived from various mammals (humans, rabbits, 
dogs, cats, guinea pigs, hamsters, rats, mice and other such mammalian 
species) and birds, such as, but are not limited to, chickens {e.g., genes 
for albumin, insulin II, erythropoietin, endothelin, osteocalcin, muscular 

15 creatine kinase, platelet-derived growth factor beta, keratins K1, K10 and 
K14, collagen types I and II, atrial natriuretic factor, dopamine beta- 
hydroxylase, endothelial receptor tyrosine kinase (generally abbreviated 
Tie2), sodium-potassium adenosine triphosphorylase (generally 
abbreviated Na,K-ATPase), neurofilament light chain, metallothioneins I 

20 and IIA, metalloproteinase I tissue inhibitor, MHC class I antigen 

(generally abbreviated H-2L), smooth muscle alpha actin, polypeptide 
chain elongation factor 1 alpha (EF-1 alpha), beta actin, alpha and beta 
myosin heavy chains, myosin light chains 1 and 2, myelin base protein, 
serum amyloid component, myoglobin, renin and other such proteins. 

25 The above-mentioned vectors can include a sequence for 

terminating the transcription of the desired messenger RNA in the 
transgenic animal (generally referred to as terminator); for example, gene 
expression can be manipulated using a sequence with such function 
contained in various genes derived from viruses, mammals and birds. The 
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simian virus SV40 terminator is a commonly used exemplary terminator. 
Additionally, for the purpose of increasing the expression of the desired 
gene, the splicing signal and enhancer region of each gene, a portion of 
the intron of a eukaryotic organism gene may be ligated 5' upstream of 
5 the promoter region, or between the promoter region and the translational 
region, or 3' downstream of the translational region as desired. 

A translational region for a protein of interest can be obtained using 
the entire or portion of genomic DNA of blood, kidney or fibroblast origin 
from various mammals (humans, rabbits, dogs, cats, guinea pigs, 

10 hamsters, rats, mice and others) or of various commercially available 
genomic DNA libraries, as a starting material, or using complementary 
DNA prepared by a known method from RNA of blood, kidney or 
fibroblast origin as a starting material. Also, an exogenous gene can be 
obtained using complementary DNA prepared by a known method from 

15 RNA of human fibroblast origin as a starting material. All these 
translational regions can be used in transgenic animals. 

To obtain the translational region, it is possible to prepare DNA 
incorporating an exogenous gene encoding the protein of interest in which 
the gene is ligated downstream of the above-mentioned promoter 

20 (generally upstream of the translation termination site) as a gene 
construct capable of being expressed in the transgenic animal. 

DNA constructs for random integration need not include regions of 
homology to mediate recombination. Where homologous recombination is 
desired, the DNA constructs contain at least a portion of the target gene 

25 with the desired genetic modification, and include regions of homology to 
the target locus. Conveniently, markers for positive and negative 
selection are included. Methods for generating cells having targeted gene 
modifications through homologous recombination are known in the art. 
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For various techniques for transfecting mammalian cells, see Keown et aL 
(1990) Methods in Enzymology 185:527-537. 

The transgenic animal can be created by introducing a COX6B or 
GPI-1 gene construct into, for example, an unfertilized egg, a fertilized 
5 egg, a spermatozoon or a germinal cell containing a primordial germinal 
cell thereof, generally in the embryogenic stage in the development of a 
non-human mammal (typically in the single-cell or fertilized cell stage and 
generally before the 8-cell phase), by standard means, such as the 
calcium phosphate method, the electric pulse method, the lipofection 

10 method, the agglutination method, the microinjection method, the particle 
gun method, the DEAE-dextran method and other such method. Also, it 
is possible to introduce a desired C0X6B or GPI-1 gene into a, for 
example, somatic cell, a living organ, a tissue cell, for example, by gene 
transformation methods, and use it for cell culture, tissue culture and 

15 other such uses. Furthermore, these cells may be fused with the above- 
described germinal cell by a commonly known cell fusion method to 
create a transgenic animal. 

For embryonic stem (ES) cells, an ES cell line may be employed, or 
embryonic cells may be obtained freshly from a host, e.g, mouse, rat, 

20 guinea pig, and other mammals and birds. Such cells are grown on an 

appropriate fibroblast-feeder layer or grown in the presence of appropriate 
growth factors, such as leukemia inhibiting factor (LIF). When ES cells 
have been transformed, they may be used to produce transgenic animals. 
After transformation, the cells are plated onto a feeder layer in an 

25 appropriate medium. Cells containing the construct may be detected by 
employing a selective medium. After sufficient time for colonies to grow, 
they are picked and analyzed for the occurrence of homologous 
recombination or integration of the construct. Those colonies that are 
positive may then be used for embryo manipulation and blastocyst 


wo 02/072604 


PCT/US02/06728 


-64- 

injection. Blastocysts are obtained from 4 to 6 week old superovulated 
females. The ES ceils are trypsinized, and the modified cells are injected 
into the blastocoel of the blastocyst. After injection, the blastocysts are 
returned to each uterine horn of pseudopregnant females. Females are 
5 then allowed to go to term and the resulting litters screened for mutant 
cells having the construct. By providing for a different phenotype of the 
blastocyst and the ES cells, chimeric progeny can be readily detected. 
The chimeric animals are screened for the presence of the modified gene 
and males and females having the modification are mated to produce 

10 homozygous progeny. If the gene alterations cause lethality at some 
point in development, tissues or organs can be maintained as allogeneic 
or congenic grafts or transplants, or in in vitro culture. 

Animals containing more than one transgene, such as allelic 
variants of C0X6B and/or GPI-1 and/or other genes associated with 

15 cardiovascular disease can be made by sequentially introducing individual 

alleles into an animal in order to produce the desired phenotype 

(manifestation or predisposition to cardiovascular disease). 

I. Effect of Allelic Variants on the Encoded Protein and Disease 
Related Phenotype 

20 The effect of an allelic variant on a C0X6B or GPI-1 protein (altered 

amount, stability, location and/or activity) can be determined according to 
methods known in the art. Allelic variants of the COX6B and GPI-1 genes 
can be assayed individually or in combination with other variants known 
to be associated with cardiovascular disease. 

25 If the mutation is located in an intron, the effect of the mutation can be 
determined, e.g., by producing transgenic animals in which the allelic 
variant linked to lipid metabolism and/or cardiovascular disease has been 
introduced and in which the wild-type gene or predominant allele may 
have been knocked out. Comparison of the level of expression of the 
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protein in the mice transgenic for the allelic variant with nnice transgenic 
for the predominant allele will reveal whether the mutation results in 
increased or decreased synthesis of the associated protein and/or aberrant 
tissue distribution of the associated protein. Such analysis could also be 
5 performed in cultured cells, in which the human variant allele gene is 
introduced and, e.g,, replaces the endogenous gene in the cell. Thus, 
depending on the effect of the alteration a specific treatment can be 
administered to a subject having such a mutation. Accordingly, if the 
mutation results in decreased production of a COX6B or GPI-1 protein, 

10 the subject can be treated by administration of a compound that increases 
synthesis, such as by increasing C0X6B or GPI-1 gene expression, and 
wherein the compound acts at a regulatory element different from the one 
that is mutated. Alternatively, if the mutation results in increased C0X6B 
or GPI-1 protein levels, the subject can be treated by administration of a 

15 compound that reduces protein production, e.g., by reducing COX6B or 
GPI-1 gene expression or a compound that inhibits or reduces the activity 
of C0X6B or GPI-1 protein. 
J. Diagnostic and Prognostic Assays 

Typically, an individual allelic variant that associates with a risk 

20 factor for cardiovascular disease will not be used in isolation as a 

prognosticator for a subject developing high cholesterol, low HDL or 
cardiovascular disease. An allelic variant typically will be one of a 
plurality of indicators that are used. The other indicators may be the 
manifestation of other risk factors for cardiovascular disease, e.g., family 

25 history, high blood pressure, weight, activity level and other indicators, or 
additional allelic variants in the same or other genes associated with 
altered lipid metabolism and/or cardiovascular disease. 

Useful combinations of allelic variants of the COX6B gene and/or 
the GPI-1 gene can be determined by examining combinations of variants 
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of these genes, which are assayed individually or assayed simultaneously 
using multiplexing methods as described above or any other labelling 
method that allows different variants to be identified. In particular, 
variants of COX6B gene and/or the GPI-1 gene may be assayed using kits 
5 (see below) or any of a variety microarrays known to those in the art. For 
example, oligonucleotide probes comprising the polymorphic regions 
surrounding any polymorphism in the COX6B or GPI-1 gene may be 
designed and fabricated using methods such as those described in U.S. 
Patent Nos. 5,492,806; 5,525,464; 5,695,940; 6,018,041; 6,025,136; 

10 WO 98/30883; WO 98/56954; W099/09218; WO 00/58516; 

WO 00/58519, or references cited therein. Similarly one of skill in the art 
can determine useful combinations of allelic variants of the C0X6B and/or 
GPI-1 genes along with variants of other genes associated with 
cardiovascular disease. 

15 K. Pharmacogenomics 

Subjects having one or more different allelic variants of the C0X6B 
or GPI-1 polymorphic regions will respond differently to therapeutic drugs 
to treat cardiovascular disease or conditions. For example, there are 
numerous drugs available for lowering cholesterol levels: including 

20 lovastatin (MEVACOR; Merck & Co.), simvastatin (XOCOR; Merck & Co.), 
dextrothyroxine (CHOLOXIN; Knoll Pharmaceutical Co.), pamaqueside 
(Pfizer), cholestryramine (QUESTRAN; Bristol-Myers Squibb), colestipol 
(COLESTID; Pharmacia & Upjohn), acipomox (Pharmacia & Upjohn), 
fenofibrate (LIPIDIL), gemfibrozil (LOPID; Warner-Lambert), cerivastatin 

25 (LIPOBAY; Bayer), fluvastatin (LESCOL; Novartis), atorvastatin (LIPITOR, 
Warner-Lambert), etofytline clofibrate (DUOLIP; Merckle (Germany)), 
probucol (LORELCO; Hoechst Marion Roussel), omacor (Pronova 
(Norway), etofibrate (Merz (Germany), clofibrate (ATROMID-S; Wyeth- 
Ayerst (AHP)), and niacin (numerous manufacturers). All patients do not 
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respond identically to these drugs. Alleles of the COX6B or the GPI-1 
gene that associate with altered lipid metabolism will be useful alone or in 
conjunction with markers in other genes associated with the development 
of cardiovascular disease to predict a subject's response to a therapeutic 
5 drug. For example, multiplex primer extension assays or microarrays 
comprising probes for alleles are useful formats for determining drug 
response. A correlation between drug responses and specific alleles or 
combinations of alleles of the COX6B or GPI-1 genes and other genes 
associated with cardiovascular disease can be shown, for example, by 

10 clinical studies wherein the response to specific drugs of subjects having 
different allelic variants of polymorphic regions of the COX6B or GPI-1 
genes alone or in combination with allelic variants of other genes are 
compared. Such studies also can be performed using animal models, 
such as mice having various alleles and in which, e.g., the endogenous 

15 COX6B or GPI-1 genes have been inactivated such as by a knock-out 
mutation. Test drugs are then administered to the mice having different 
alleles and the response of the different mice to a specific compound is 
compared. Accordingly, assays, microarrays and kits are provided for 
determining the drug that will be best suited for treating a specific disease 

20 or condition in a subject based on the individual's genotype. For 

example, it will be possible to select drugs that will be devoid of toxicity, 
or have the lowest level of toxicity possible for treating a subject having a 
disease or condition, e.g., cardiovascular disease or high cholesterol or 
low HDL. 

25 L. Kits 

Kits can be used to indicate whether a subject is at risk of 
developing high cholesterol, low HDL and/or cardiovascular disease. The 
kits also can be used to determine if a subject who has high cholesterol or 
low HDL carries associated variants in the C0X6B or GPI-1 genes or other 
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cardiovascular disease-related genes. This information could be used, 
e.g.r to optimize treatment of such individuals as a particular genotype 
may be associated with drug response. 

In certain, the kits include a probe or primer that is capable of 
5 hybridizing adjacent to or at a polymorphic region of a COX6B or GPI-1 
gene and thereby identifying whether the COX6B or GPI-1 gene contains 
an allelic variant that is associated with cardiovascular disease. Primers 
or probes that specifically hybridize at or adjacent to the SNPs described 
in Tables 1-3 could be included. In particular, primers or probes that 

10 contain the sequences of SEQ ID NOs.: 5, 10, 43, 48, 53, 58, 63, 68, 

73, 78, 83, 88, 93, 98, 1 03, 1 08, 11 3, and 1 1 8 could be included in the 
kits. The kits optionally also include instructions for use in carrying out 
assays, interpreting results and diagnosing a subject as having a 
predisposition toward developing high cholesterol, low HDL and/or 

15 cardiovascular disease. 

Exemplary kits for amplifying a region of a C0X6B gene, GPI-1 
gene, or other genes associated with cardiovascular disease (such as 
those listed in Table 3) contain two primers that flank a polymorphic 
region of the gene of interest. For example primers can include the 

20 sequences of SEQ ID NOs.: 3, 4, 8, 9, 41, 42, 46, 47, 51, 52, 56, 57, 
61, 62, 66, 67, 71, 72, 76, 77, 81, 82, 86, 87, 91, 92, 96, 97, 101, 
1 02, 1 06, 1 07, 111, 112, 116, and 117. For other assays, primers or 
probes hybridize to a polymorphic region or 5' or 3' to a polymorphic 
region depending on which strand of the target nucleic acid is used. For 

25 example, specific probes and primers contain sequences designated as 
SEQ ID NOs: 5, 10, 43, 48, 53, 58, 63, 68, 73, 78, 83, 88, 93, 98, 
1 03, 1 08, 1 1 3, and 1 1 8. Those of skill in the art can synthesize primers 
and probes that hybridize adjacent to or at the polymorphic regions 
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described in TABLES 1-3 and other SNPs in genes associated with 
cardiovascular disease. 

Yet other kits contain at least one reagent necessary to perform an 
assay. For example, the kit can comprise an enzyme, such as a nucleic 
5 acid polymerase. Alternatively the kit can contain a buffer or any other 
necessary reagent. 

Yet other kits contain microarrays of probes to detect allelic 
variants of COX6B, GPI-1, and other genes associated with 
cardiovascular disease. The kits further contain instructions for their use 

10 and interpreting the results. 

The following examples are included for illustrative purposes only 
and are not intended to limit the scope of the invention. The practice of 
methods and development of the products provided herein employ, unless 
otherwise indicated, conventional techniques of cell biology, cell culture, 

15 molecular biology, transgenic biology, microbiology, recombinant DNA, 
and immunology, which are within the skill of the art. Such techniques 
are explained fully in the literature. See, for example. Molecular Cloning 
A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis 
(Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I 

20 and II (D.N. Glover ed., 1985); Oligonucleotide Synthesis (M.J. Gait ed., 
1984); Mullis et al. U.S. Patent No. 4,683,195; Nucleic Acid 
Hybridization (B.D. Hames & S.J. Higgins eds. 1984); Transcription and 
Translation (B.D. Hames & S.J. Higgins eds. 1984); Culture of Animal 
Cells (R.I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells and 

25 Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide To Molecular 
Cloning (1984); the treatise, Methods In Enzymology (Academic Press, 
Inc., New York); Gene Transfer Vectors For Mammalian Cells (J.H. Miller 
and M.P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In 
Enzymology, Vols. 154 and 155 (Wu et al. eds., Immunochemical 
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Methods In Cell and Molecular Biology (Mayer and Walker, eds., 
Acadennic Press, London, 1987); Handbook of Experimental Immunology, 
Volumes l-IV (D.M. Weir and C.C. Blackwell, eds., 1986); Manipulating 
the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
5 Harbor, N.Y., 1986). 

EXAMPLE 1 

Isolation of DNA from blood samples of a stratified population 

Blood samples were obtained from a population of unrelated 
Caucasian women between the ages of 18-79 (average age = 48). The 

10 women had, no response to media campaigns, attended the Twin 

Research Unit at the St. Thomas Hospital in London, England. For current 
purposes, only one member of a twin pair was used to insure that all 
observations were independent. Blood samples from 1400 unrelated 
individuals were measured for levels of cholesterol and HDL. 

15 Cholesterol and HDL level in blood samples were quantitated using 
standard assay methods. 

The population was stratified into pools of 200 people, which 
represented the lower extreme and the upper extreme for serum levels of 
cholesterol and HDL. 

20 Cholesterol 

Pool 1: Individuals were considered to have low 
cholesterol (0.12 - 3.6 mmoles/L). 

Pool 2: Individuals were considered to have high 
cholesterol (5.25 - 1 1 .57 mmoles/L). 
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HDL 

Pool 3: Individuals were considered to have low levels 

of HDL (0.240 - 1.11 mmoles/L) 
Pool 4: Individuals were considered to have high levels 
5 of HDL (2.10 - 3.76 mmoles/L). 

DNA extraction protocol 

DNA was extracted from blood samples of each of the pools by 
utilizing the following protocol. 
Section 1 

10 1. Blood was extracted into EDTA tubes. 

2. Blood sample was spun at 3,000 rpm for 10 minutes in a 
clinical centrifuge. 

3. The buffy coat (the leucocytes, a yellowish layer of cells on 
top of the red blood cells) was removed and pooled into a 1 

15 ml conical tube. 

4. 0.9% saline was added to fill the tube and resuspend the 
leucocytes. Sample were immediately further processed or 
stored at 4*^C for 24 hrs. 

5. The sample was spun at 2,500 rpm for 10 minutes. 

20 6. The buffy coat was again removed as cleanly as possible 

leaving behind any red cells, the sample was suspended in 
red cell lysis buffer and left for 20 minutes at 4^C. 

7. The sample was spun again at 2,500 rpm for 10 minutes. If 
a pellet of unlysed red cells remained lying above the 

25 leucocytes the treatment with red cell lysis buffer was 

repeated. 

8. The leucocyte pellet was resuspended in 2 ml 0.9% saline. 

9. The DNA was liberated by the addition of leucocyte lysis 
buffer - the tube was capped and gently inverted several 
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times, until the liquid became viscous with DNA. The 
samples were handled with care to avoid shearing and 
damage to the DNA. 

10. Samples were frozen for storage prior to full extraction. 
5 Section 2 

11. 2 ml of 5 M sodium perchlorate was added to the thawed 
sample and mixed by inversion. The sample was heated to 
60°C for 30 - 40 minutes to fully denature proteins. 

12. An equal volume of chloroform/isoamyl alcohol (24:1) was 
10 added at room temperature and the sample mixed for 10 

minutes. 

13. The sample was spun without a break at 3,000 rpm for 10 
minutes. 

14. The top aqueous phase was removed into a clean tube and 
15 two volumes of cold 100% ethanol added and mixed by 

inversion to precipitate DNA. 

15. The DNA was removed using a sterile loop and resuspended 
in 1-5 ml TE buffer depending on the DNA yield. 

16. The optical density was measured at 260 and 280 nm to 
20 check yield and purity of the DNA sample. For use in 

Examples 2 and 3, all DNA had an absorbance ratio of 1 .6 at 
260/280, a total yield of 32 /yg and a concentration of 10 
ng//7l. If initial purity levels were unacceptable a re- 
extraction was carried out (sections 12-15 above). 
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EXAMPLE 2 

Detection of an Association Between an SNP at Position 86 of the Human 
COX6B Gene and High Cholesterol 

DNA samples (as prepared in Example 1), representing 200 

5 women, from the lower extreme, pool 1 (low levels of cholesterol) and 

the upper extreme, pool 2 (high levels of cholesterol) were amplified and 

analyzed for genetic differences using a MassEXTEND'" assay detection 

method. For each pool, single nucleotide polymorphisms were examined 

throughout the entire genome to detect differences in allelic frequency of 

10 a variant allele between the pools. 

PCR Amplification of Samples from Pools 1 and 2 

PCR primers were synthesized by Operon (Alameda, CA) using 
phosphoramidite chemistry. Amplification of the C0X6B target sequence 
was carried out in two 50 /vl PCR reactions with 100 ng of pooled human 

15 genomic DNA, obtained as described in Example 1, taken from samples in 
pool 1 or pool 2, although amounts ranging from 100 ng to 1 ug could be 
used. Individual DNA concentrations within the pooled samples were 
present in equal concentration with a final concentration of 0.5 ng. Each 
reaction contained IX PCR buffer (Qiagen, Valencia, CA), 200 //M dNTPs, 

20 1U Hotstar Taq polymerase (Qiagen, Valencia, CA), 4 mM MgClj, and 25 
pmols of the long primer containing both the universal primer sequence 
and the target specific sequence 

5'-AGCGGATAACAATTTCACACAGGTAGTCTGGTTCTGGTTGGGG-3' 
(SEQ ID NO.: 4) , 2 pmoles of the short primer 
25 5'-AGGATTCAGCACCATGGC-3' (SEQ ID NO.: 3) and 10 pmoles of a 
biotinylated universal primer complementary to the 5' end of the PCR 
amplicon 5'-AGCGGATAACAATTTCACACAGG-3' (SEQ ID NO.: 121). 
Alternatively, the biotinylated universal primer could be 5'- 
GGCGCACGCCTCCACG-3' (SEQ ID NO.: 122). After an initial round of 
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amplification with the target with the specific forward (long) and reverse 
prinner (short), the 5' biotinylated universal primer then hybridized and 
acted as a reverse primer thereby introducing a 3' biotin capture moiety 
into the molecule. The amplification protocol results in a 5'-biotinylated 
5 double stranded DNA amplicon and dramatically reduces the cost of high 
throughput genotyping by eliminating the need to 5' biotin label each 
forward primer used in a genotyping. Thermal cycling was performed in 
0.2 mL tubes or 96 well plate using an MJ Research Thermal Cycler 
(Waltham, MA) (calculated temperature) with the following cycling 

10 parameters: 94°C for 5 min; 45 cycles: 94°C for 20 sec, 56^C for 30 
sec, 72°C for 60 sec; 72^C 3 min. 
Immobilization of DNA 

The 50fj\ PGR reaction was added to 25/yl of streptavidin coated 
magnetic bead (Dynal, Lake Success, NY) prewashed three times and 

15 resuspended in 1 M NH4CI, 0.06 M NH4OH. The PGR amplicons were 
allowed to bind to the beads for 15 minutes at room temperature. The 
beads were then collected with a magnet and the supernatant containing 
unbound DNA was removed. The unbound strand was released from the 
double stranded amplicons by incubation in 100 mM NaOH and washing 

20 of the beads three times with 10 mM Tris pH 8.0. 
Genotyping 

The frequency of the alleles at position 86 in the GOX6B gene was 
measured using the MassEXTEND'" assay and MALDI-TOF. The SNP 
identified at position 86 of GOX6B in the GenBank sequence is 
25 represented as a G to T transversion. The MassEXTEND'"" assay used 
detected the sequence of the complementary strand, thus the SNP was 
represented as G to A in the primer extension products. The DNA coated 
magnetic beads were resuspended in 26 mM Tris-HGL pH 9.5, 6.5 mM 
MgGIs and 50 mM each of dTTPs and 50 mM each of ddGTP, ddATP, 
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ddGTP, 2.5 U of a thermostable DNA polymerase (Amersham Pharmacia 
Biotech, Piscataway, NJ) and 20 pmoles of a template specific 
oligonucleotide primer 5'-AATCAAGAACTACAAGAC-3' (SEQ ID NO.: 5) 
(Operon, Alameda, CA). Primer extension occurred with three cycles of 
5 oligonucleotide primer hybridization and extension. The extension 

products were analyzed after denaturation from the template with 50 mM 
NH4CI and transfer of 150 nl of each sample to a silicon chip preloaded 
with 150 nl of H3PA (3-hydroxy picolinic acid) (Sigma Aldrich, St Louis, 
MO) matrix material. The sample material was allowed to crystallize and 

10 analyzed by MALDI-TOF (Bruker Daltonics, Billerica, MA; PerSeptive, 
Foster City, CA). The mass of the primer used in the MassEXTEND^" 
reaction was 5493.70 daltons. The predominant allele is extended by the 
addition of ddC, which has a mass of 5766.90 daltons. The allelic variant 
results in the addition of dT and ddG to the primer to produce an 

1 5 extension product having a mass of 6111.10 daltons. 

In addition to being analyzed as part of a pool, each individual 
sample (0.5 ng) was amplified as described above and analyzed 
individually using a MassEXTEND^^ reaction as described above. 

Pooled populations of women (200 women per pool) with high 

20 cholesterol (pool 2) showed an increase in the frequency of the A allele at 
nucleotide position 86 of COX6B as compared with those with low levels 
of cholesterol (pool 1) (see Fig. 1). The association of this allelic variant 
of the C0X6B gene with high cholesterol gave a statistically significant 
value of 14.30 using a 1 -degree-of-freedom chi-squared test of 

25 association. In other words, the increase of 2.75% to 9.05% is 

significant, with a p value of 0.000156 (see Fig. 1). The genotype of 
each of the individuals in the pooled population was also determined by 
carrying out MassEXTEND'" reactions on each DNA samples individually. 
These analysis confirmed the pooling data showing that there was an 
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increase in the frequency of the A allele of 2.27% to 9.93%, 
(p = 0.0000061 ). The genotypes in pool 2 showed a decrease in the 
homozygous GG genotype from 95.4% to 82.35% and an increase in the 
heterozygous GA genotype from 4.55% to 15.44%. None of the 
5 individuals with low levels of serum cholesterol exhibited the homozygous 
AA genotype. 

EXAMPLE 3 

Detection of an Association Between an SNP at Position 2577 of the 
IHuman GPI-1 Gene and Low HDL 

10 DNA samples (as prepared in Example 1), representing 200 

women, from pool 3 (low level of HDL) and pool 4 (high levels of HDL) 
were amplified and analyzed for genetic differences using a 
MassEXTEND^" detection method. For each pool, SNPs were examined 
throughout the genome to detect differences in allelic frequency of variant 

15 alleles between the pools. 

PGR Amplification of Samples from Pools 3 and 4 

PGR primers were synthesized by Operon (Alameda, CA) using 
phosphoramidite chemistry. Amplification of the GPI-1 target sequence 
was carried out in single 50/yl PGR reaction with 100 ng of pooled human 

20 genomic DNA (200 samples), obtained as described in Example 1, taken 
from samples in pool 3 or pool 4, although amounts ranging from 100 ng 
to 1 ug could be used. Individual DNA concentrations within the pooled 
samples were present in equal concentration with the final concentration 
of 0.5 ng. Each reaction contained IX PGR buffer (Qiagen, Valencia, 

25 CA), 200 uM dNTPs, 1U Hotstar Taq polymerase (Qiagen, Valencia, CA), 
4 mM MgCl2, and 25 pmols of the forward primer containing both the 
universal primer sequence and the target specific short sequence 
5'-AGCAGGGCTTCCTCCTTC-3' (SEQ ID NO.: 8) 2 pmoles of the long 
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primer 5'-AGCGGATAACAATTTCACACAGGTGACCCAGCCGTACCTATTC-3' 
(SEQ ID NO.: 9) and 10 pmoles of a biotinylated universal primer 
complementary to the 5' end of the PGR amplicon 

5'-AGCGGATAACAATTTCACACAGG-3' (SEQ ID NO.: 121). After an 
5 initial round of amplification with the target with the specific forward 

(long) and reverse primer (short), the 5' biotinylated universal primer then 
hybridized and acted as a reverse primer thereby introducing a 3' biotin 
capture moiety into the molecule. The amplification protocol results in a 
5'-biotinylated double stranded DNA amplicon and dramatically reduces 

10 the cost of high throughput genotyping by eliminating the need to 5' 
biotin label each forward primer used in a genotyping. Thermal cycling 
was performed in 0.2 mL tubes or 96 well plate using an MJ Research 
Thermal Cycler (Watham, MA) (calculated temperature) with the following 
cycling parameters: 94^0 for 5 min; 45 cycles: 94°C for 20 sec, 56°C 

15 for 30 sec, 72°C for 60 sec; 72°C 3 min. 
Immobilization of DNA 

The 50 fj\ PGR reaction was added to 25 /j\ of streptavidin coated 
magnetic bead (Dynal, Lake Success, NY) prewashed three times and 
resuspended in 1 M NH4CI, 0.06 M NH4OH. The PGR amplicons were 

20 allowed to bind to the beads for 15 minutes at room temperature. The 
beads were then collected with a magnet and the supernatant containing 
unbound DNA was removed. The unbound strand was released from the 
double stranded amplicons by incubation in 100 mM NaOH and washing 
of the beads three times with 10 mM Tris pH 8.0. 

25 Genotyping 

The frequency of the alleles at position 2577 in the GPI-1 gene was 
measured using the MassEXTEND'^ assay and MALDI-TOF. The SNP 
identified at position 2577 of GPI-1 in the GenBank sequence is 
represented as a G to A transversion. The MassEXTEND'" assay used 
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detected this sequence, thus the SNP was represented as C to T in the 
primer extension products. The DNA coated magnetic beads were 
resuspended in 26 mM Tris-HCL pH 9.5, 6.5 mM MgCIs and 50 mM each 
of dTTPs and 50 mM each of ddCTP, ddATP, ddGTP, 2,5 U of a 
5 thermostable DNA polymerase (Amersham Pharmacia Biotech, 

Piscataway, NJ) and 20 pmoles of a template specific oligonucleotide 
primer 5'-AAGGGAGACAGATTTGGC-3' (SEQ ID NO.: 10) (Operon, 
Alameda, CA). Primer extension occurred with three cycles of 
oligonucleotide primer hybridization and extension. The extension 

10 products were analyzed after denaturation from the template with 50 mM 
NH4CI and transfer of 150 nl each sample to a silicon chip preloaded with 
150 nl of H3PA matrix material. The sample material was allowed to 
crystallize and analyzed by MALDI-TOF (Bruker Daltonics, Billerica, MA; 
PerSeptive, Foster City, CA). The mass of the primer used in the 

15 MassEXTEND'"" reaction was 5612.70 daltons. The predominant allele is 
extended by the addition of ddC, which has a mass of 5885.90 daltons. 
The allelic variant results in the addition of dT and ddG to the primer to 
produce an extension product having a mass of 6230.10 daltons. 

In addition to being analyzed as a pool, each individual sample (0.5 

20 ng) was amplified as described above and analyzed individually using the 
MassEXTEND'" reaction as described above. 

Pooled populations of women (200 women per pool) with low HDL 
(pool 3) showed an increase in the T allele of 1 1 .33% at nucleotide 
position 2577 as compared with those with high levels of HDL (pool 4). 

25 The association of this allelic variant of the GPI-1 gene with low HDL 

gave a statistically significant value of 15.04 using a 1 -degree-of-freedom 
chi-squared test of association. In other words, the increase of 16.23% 
to 27.57% is significant, with a p value of 0.0001064 (see Fig. 2). The 
genotype of each of the individuals in the pooled population was also 
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determined by carrying out individual MassEXTEND'" reactions on 
individual DNA samples. These analysis confirmed the pooling data 
showing that there was an increase in the frequency of the T allele of 
19.49% to 26.1%, {p = 0.024). The measured genotypes in pool 3 
5 showed a decrease in the homozygous CC genotype from 65.24% to 
54.21% and an increase in the heterozygous CT genotype from 30.51% 
to 39.25%. The homozygous TT genotypes increased 2.3%. 

Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended 
10 claims. 
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WHAT IS CLAIMED: 

1 . A method for detecting the presence or absence in a subject 
of at least one allelic variant of a polymorphic region of a gene associated 
with cardiovascular disease, comprising: 

5 the step of detecting the presence or absence of an allelic variant 

of a polymorphic region of a cytochrome C oxidase subunit VIb (C0X6B) 
gene of the subject that is associated with high serum cholesterol or an 
allelic variant of a polymorphic region of a N-acetylglucosaminyl 
transferase component (GPI-1) gene of the subject that is associated with 
10 low serum high density lipoprotein (HDL). 

2. The method of claim 1, wherein the allelic variant is of a 
polymorphic region of the cytochrome C oxidase subunit VIb (COX6B) 
gene. 

3. The method of claim 1, wherein the allelic variant is of a 

15 polymorphic region of the N-acetylglucosaminyl transferase component 
(GPI-1) gene. 

4. The method of any of claims 1-3, further comprising 
detecting the presence or absence in a subject of least one allelic variant 
of another gene associated with cardiovascular disease. 

20 5. The method of claim 4, wherein the other gene is selected from 

the group consisting of cholesterol ester transfer protein, plasma (CETP); 
apolipoprotein A-IV (APO A4); apolipoprotein A-l (APO A1); apolipoprotein 
E (APO E); apolipoprotein B (APO B); apolipoprotein C-lll (APO 03); a 
gene encoding lipoprotein lipase (LPL); ATP-binding cassette transporter 

25 (ABC 1); paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 5,10- 

methylenetetrahydrofolate r reductase (MTHFR); a gene encoding hepatic 
lipase, E-selectin, G protein beta 3 subunit and angiotensin II type 1 
receptor gene. 

6. The method of claim 2 or claim 3, wherein the polymorphic 
30 region is a single nucleotide polymorphism (SNP). 
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7. The method of any of claims 1-6, wherein the detection is 
effected by detecting a a light producing reagent. 

8. The method of claim 6, wherein the SNP is at position 86 of 
the cytochrome C oxidase subunit VIb (C0X6B) gene coding sequence 

5 and the allelic variant is represented by a T nucleotide in the sense strand 
or an A nucleotide in the corresponding position in the antisense strand. 

9. The method of claim 6, wherein the SNP is at position 2577 
of the N-acetylgluocsaminyl transferase component GPI-1 (GPI-1) gene 
sequence and the allelic variant is represented by an A nucleotide in the 

10 sense strand or a T nucleotide in the corresponding position in the 
antisense strand. 

10. The method of any of claims 1-3, wherein the detecting step 
is by a method selected from the group consisting of allele specific 
hybridization, primer specific extension, oligonucleotide ligation assay, 

15 restriction enzyme site analysis and single-stranded conformation 
polymorphism analysis. 

1 1 . The method of claim 8, further comprising: 

(a) hybridizing a target nucleic acid comprising a 
cytochrome C oxidase subunit VIb (COX6B)-encoding nucleic acid 

20 or fragment thereof with a nucleic acid primer that hybridizes 

adjacent to nucleotide 86 of the coding sequence of the C0X6B 
gene; 

(b) extending the nucleic acid primer using the target 
nucleic acid as a template; and 

25 (c) determining the mass of the extended primer to 

identify the nucleotide present at position 86, thereby determining 
the presence or absence of the allelic variant. 
12. The method of claim 9, further comprising: 

(a) hybridizing a target nucleic acid comprising a N- 

30 acetylglucosaminyl transferase component GPI-1 (GPI-1 )-encoding 
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nucleic acid or fragment thereof with a nucleic acid primer that 
hybridizes adjacent to nucleotide 2577 of the GPI-1 gene; 

(b) extending the nucleic acid primer using the target 
nucleic acid as a template; and 
5 (c) determining the mass of the extended primer to 

identify the nucleotide present at position 2577, thereby 
determining the presence or absence of the allelic variant. 
13. The method of any of claims 1-12, wherein the detecting 
step comprises mass spectrometry. 
10 14. The method of any of claims 1-6 and 8-12, wherein the 

detection is effected by detecting a signal moiety selected from the group 
consisting of radioisotopes, enzymes, antigens, antibodies, 
spectrophotometric reagents, chemiluminescent reagents and fluorescent 
reagents. 

15 15. The method of claim 11 or claim 12, wherein the nucleic acid 

primer is extended in the presence of at least one dideoxynucleotide. 

16. The method of claim 15 or claim 16, wherein the 
dideoxynucleotide is dideoxyguanosine (ddG). 

17. The method of claim 1 1, wherein the primer is extended in 
20 the presence at least two dideoxynucleotides and the dideoxynucleotides 

are dideoxyguanosine (ddG) and dideoxycytosine (ddC). 

18. The method of claim 12, wherein the primer is extended in 
the presence of at least two dideoxynucleotides and the dideoxy- 
nucleotides are dideoxyguanosine (ddG) and dideoxycytosine (ddC). 

25 19. A method for indicating a predisposition to cardiovascular 

disease in a subject, comprising: 

the step of detecting in a target nucleic acid obtained from the 
subject the presence or absence of at least one allelic variant of 
polymorphic regions of a cytochrome C oxidase subunit VIb (C0X6B) 

30 gene associated with high serum cholesterol or at least one allelic variant 
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of polymorphic regions of a N-acetylglucosaminyl transferase component 
GPI-1 (GPI-1) gene associated with low serum HDL wherein the presence 
of an allelic variant is indicative of a predisposition to cardiovascular 
disease compared to a subject who does not comprise the allelic variant. 
5 20. The method of claim 19, wherein the allelic variant is of a 

polymorphic region of the cytochrome C oxidase subunit VIb (COX6B) 
gene. 

21. The method of claim 19, wherein the allelic variant is of a 
polymorphic region of the N-acetylglucosaminyl transferase component 

10 GPI-1 (GPI-1) gene. 

22. The method of claim 20 or claim 21, wherein the 
polymorphic region is a single nucleotide polymorphism (SNP). 

23. The method of claim 22, wherein the SNP is at position 86 
of the cytochrome C oxidase subunit VIb (COX6B) gene coding sequence 

15 and the allelic variant is represented by a T nucleotide in the sense strand 
or an A nucleotide in the corresponding position in the antisense strand. 

24. The method of claim 22, wherein the SNP is at position 
2577 of the N-acetylgluosaminyl transferase component GPI-1 (GPI-1) 
gene sequence and the allelic variant is represented by an A nucleotide in 

20 the sense strand or a T nucleotide in the corresponding position in the 
antisense strand. 

25. The method of claim 19, wherein the detecting step is by a 
method selected from the group consisting of allele specific hybridization, 
primer specific extension, oligonucleotide ligation assay, restriction 

25 enzyme site analysis and single-stranded conformation polymorphism 
analysis. 
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26. The method of claim 23, further comprising: 

(a) hybridizing a target nucleic acid comprising a cytochrome C 
oxidase subunit VIb (COX6B)-encoding nucleic acid or fragment thereof 
with a nucleic acid primer that hybridizes adjacent to nucleotide 86 of the 

5 coding sequence of the C0X6B gene; 

(b) extending the nucleic acid primer using the target nucleic 
acid as a template; and 

(c) determining the mass of the extended primer to identify the 
nucleotide present at position 86, thereby determining the presence or 

10 absence of the allelic variant. 

27. The method of claim 24, further comprising: 

(a) hybridizing a target nucleic acid comprising a N- 
acetylglucosaminyl transferase component GPI-1 (GPI-1 )-encoding nucleic 
acid or fragment thereof with a nucleic acid primer that hybridizes 

15 adjacent to nucleotide 2577 of the GPI-1 gene; 

(b) extending the nucleic acid primer using the target nucleic 
acid as a template; and 

(c) determining the mass of the extended primer to identify the 
nucleotide present at position 2577, thereby determining the presence or 

20 absence of the allelic variant. 

28. The method of claim 19, wherein the detecting step 
comprises mass spectrometry. 

29. The method of claim 19, wherein the detection is effected by 
detecting a signal moiety selected from the group consisting of: 

25 radioisotopes, enzymes, antigens, antibodies, spectrophotometric 

reagents, chemiluminescent reagents, fluorescent reagents and other light 
producing reagents. 

30. The method of claim 19, further comprising detecting the 
presence or absence of at least one allelic variant of polymorphic regions 

30 of another gene associated with cardiovascular disease, wherein the 


wo 02/072604 


PCT/US02/06728 


-85- 

presence of the two allelic variants is associated with a predisposition to 
cardiovascular disease compared to a subject who does not comprise the 
combination of allelic variants. 

31 . The method of claim 30, wherein the other gene is selected 
5 from the group consisting of cholesterol ester transfer protein, plasma 

(CETP); apolipoprotein A-IV (APO A4); apolipoprotein A-l (APO A1); 
apolipoprotein E (APO E); apolipoprotein B (APO B); apolipoprotein C-lll 
(APO C3); a gene encoding lipoprotein lipase (LPL); ATP-binding cassette 
transporter (ABC 1); paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 
10 5,10-methylenetetrahydrofolate r reductase (MTHFR); a gene encoding 

hepatic lipase, E-selectin, G protein beta 3 subunit and angiotensin II type 
1 receptor gene. 

32. The method of claim 30, wherein the two allelic variants are of 
the cytochrome C oxidase subunit VIb (C0X6B) gene and the N- 

15 acetylglucosaminyl transferase component GPI-1 (GPI-1) gene. 

33. A kit comprising: 

(a) at least one probe specific for a polymorphic region of 
a human gene selected from the group consisting of cytochrome C 
oxidase subunit VIb (COX6B); N-acetylglucosaminyl transferase 

20 component GPI-1 (GPI-1); cholesterol ester transfer protein, plasma 

(CETP); apolipoprotein A-IV (APO A4); apolipoprotein A-l (APO A1); 
apolipoprotein E (APO E); apolipoprotein B (APO B); apolipoprotein 
C-lll (APO C3); a gene encoding lipoprotein lipase (LPL); ATP- 
binding cassette transporter (ABC 1); paraoxonase 1 (PON 1); 

25 paraoxonase 2 (PON 2); 5, 1 0-methylenetetrahydrofolate r 

reductase (MTHFR); a gene encoding hepatic lipase, E-selectin, G 
protein beta 3 subunit and angiotensin II type 1 receptor gene; and 

(b) instructions for use. 
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34. A method of screening for biologically active agents that 
modulate serum cholesterol, comprising: 

(a) combining a candidate agent with a cell comprising a 
nucleotide sequence encoding an allelic variant of a cytochrome C 

5 oxidase subunit VIb (COX6B) gene associated with high levels of 

serum cholesterol and operably linked to a promoter such that the 
nucleotide sequence is expressed as a COX6B protein in the cell; 
and 

(b) determining the affect of the agent upon the 
10 expression and/or activity of the COX6B protein. 

35. A method of screening for biologically active agents that 
modulate serum cholesterol, comprising: 

(a) combining a candidate agent with a transgenic mouse 
comprising a transgenic nucleotide sequence stably integrated into 

15 the genome of the mouse, wherein the transgenic nucleotide 

sequence encodes an allelic variant of a cytochrome C oxidase 
subunit VIb (COX6B) gene associated with high levels of serum 
cholesterol and operably linked to a promoter, wherein the 
transgenic nucleotide sequence is expressed and the transgenic 

20 animal develops a high level of serum cholesterol; and 

(b) determining the affect of the agent upon the serum 
cholesterol level. 

36. The method of claim 34 or claim 37 wherein the allelic 
variant is at position 86 of the cytochrome C oxidase subunit VIb 

25 (COX6B) gene. 

37. A method of screening for biologically active agents that 
modulate serum high density lipoprotein (HDL), comprising: 

(a) combining a candidate agent with a cell comprising a 
nucleotide sequence encoding an allelic variant of a N- 
30 acetylglucosaminyl transferase component GPI-1 (GPI-1) gene 
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associated with low levels of serum HDL and operably linked to a 
promoter such that the nucleotide sequence is expressed as a 
GPI-1 protein in the cell; and 

(b) determining the affect of the agent upon the 
5 expression and/or activity of the GPI-1 protein. 

38. A method of screening for biologically active agents that 
modulate serum high density lipoprotein (HDL), comprising: 

(a) combining a candidate agent with a transgenic mouse 
comprising a transgenic nucleotide sequence stably integrated into 

10 the genome of the mouse encoding an allelic variant of a N- 

acetylglucosaminyl transferase component GPI-1 (GPI-l) gene 
associated with low levels of serum HDL operably linked to a 
promoter, wherein the transgenic nucleotide sequence is expressed 
and the transgenic animal develops a low level of serum HDL; and 

15 (b) determining the affect of the agent upon the serum 

HDL level. 

39. The method of claim 37 or claim 38, wherein the allelic 
variant is at position 2577 of the N-acetylglucosaminyl transferase 
component GPI-1 (GPI-l) gene. 

20 40. A method for predicting a response of a subject to a 

cardiovascular drug, comprising: 

detecting the presence or absence of at least one allelic variant of a 
cytochrome C oxidase subunit VIb (COX6B) gene of the subject 
associated with high serum cholesterol or at least one allelic variant of a 
25 N-acetylglucosaminyl transferase component GPI-1 (GPI-1) gene of the 
subject associated with low serum high density lipoprotein (HDL); 

wherein the presence of at least one allelic variant is indicative of a 
positive response. 

41 . The method of claim 40, wherein the allelic variant is of the 
30 cytochrome C oxidase subunit VIb (C0X6B) gene. 
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42. The method of claim 40, wherein the allelic variant is of the 
N-acetylglucosaminyl transferase component GPI-1 (GPI-1) gene. 

43. A method for predicting a response of a subject to a 
cardiovascular drug, comprising: 

5 detecting the presence or absence of at least one allelic variant of a 

cytochrome C oxidase subunit VIb (C0X6B) gene of the subject 
associated with high serum cholesterol; and 

detecting the presence or absence of or at least one allelic variant 
of a N-acetylglucosaminyl transferase component GPI-1 (GPI-1) gene of 
10 the subject associated with low serum high density lipoprotein (HDL); 

wherein the presence of at least one allelic variant of the COX6B 
and at least one allelic variant of the GPI-1 gene is indicative of a positive 
response. 

44. A method for predicting a response of a subject to a 

15 biologically active agent that modulates serum cholesterol, comprising: 

detecting the presence or absence of at least one allelic variant of a 
cytochrome C oxidase subunit VIb (COX6B) gene of the subject 
associated with high cholesterol ; 

wherein the presence of at least one allelic variant is indicative of a 
20 positive response. 

45. A method for predicting a response of a subject to a 
biologically active agent that modulates serum cholesterol, comprising: 

detecting the presence or absence of at least one allelic variant of a 
cytochrome C oxidase subunit Vtb (C0X6B) gene of the subject 
25 associated with high cholesterol; and 

detecting the presence or absence of an allelic variant of at least 
one other gene of the subject associated with cardiovascular disease, 
wherein the presence of both allelic variants is indicative of a positive 
response. 
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46. The method of claim 44 or claim 45, wherein the allelic 
variant of the cytochrome C oxidase subunit VIb (C0X6B) gene is at 
position 86, 

47. A method for predicting a response of a subject to a 

5 biologically active agent that modulates serum high density lipoprotein 
(HDL), comprising: 

detecting the presence or absence of at least one allelic variant of a 
N-acetylglucosaminyl transferase component GPI-1 (GPI-1) gene of the 
subject associated with low HDL; wherein the presence of an allelic 
10 variant is indicative of a positive response. 

48. A method for predicting a response of a subject to a 
biologically active agent that modulates serum high density lipoprotein 
(HDL) levels, comprising: 

(a) detecting the presence or absence of at least one 

15 allelic variant of a N-acetylglucosaminyl transferase component GPI- 

1 (GPI-1) gene associated with low HDL of the subject; and 

(b) detecting the presence or absence of an allelic variant 
in at least one other gene of subject associated with cardiovascular 
disease, wherein the presence of both allelic variants is indicative 

20 of a positive response. 

49. The method of claim 47 or claim 48, wherein the allelic 
variant of a N-acetylglucosaminyl transferase component GPI-1 (GPI-1) 
gene is at position 2577. 

50. The method of claim 45 or 48, wherein the other gene 
25 associated with cardiovascular disease is selected from the group of 

genes consisting of N-acetylglucosaminyl transferase component GPI 
(GPI-1) gene, cholesterol ester transfer protein, plasma (CETP); 
apolipoprotein A-IV (APO A4); apolipoprotein A-l (APO A1); apolipoprotein 
E (APO E); apolipoprotein B (APO B); apolipoprotein C-lll (APO C3); a 
30 gene encoding lipoprotein lipase (LPL); ATP-binding cassette transporter 
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(ABC 1); paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 5,10- 
methylenetetrahydrofolate r reductase (MTHFR); a gene encoding hepatic 
lipase, E-selectin, G protein beta 3 subunit and angiotensin II type 1 
receptor gene. 

5 51 . A primer or probe that specifically hybridizes adjacent to or 

at a polymorphic region of a cytochrome C oxidase subunit VIb {COX6B) 
gene associated with high serum cholesterol in combination with a primer 
or probe that specifically hybridizes adjacent to or at a polymorphic region 
of a N-acetylglucosaminyl transferase component GPI-1 (GPI-1) gene 
10 associated with low HDL. 

52. The primers or probes of claim 51, further comprising 
primers or probes that specifically hybridizes adjacent to or at a 
polymorphic region of another gene associated with cardiovascular 
disease. 

15 53. The primers or probes of claim 51, wherein the polymorphic 

region of the cytochrome C oxidase subunit VIb (C0X6B) gene comprises 
nucleotide 86 of the coding strand and the polymorphic region of the N- 
acetylglucosaminyl transferase component GPI-1 (GPI-1) gene comprises 
nucleotide 2577. 

20 54. The primers or probes of claim 52, wherein the 

other gene associated with cardiovascular disease is selected from the 
group of genes consisting of cholesterol ester transfer protein, plasma 
(CETP); apolipoprotein A-IV (APO A4); apolipoprotein A-1 (APO Al); 
apolipoprotein E (APO E); apolipoprotein B (APO B); apolipoprotein C-lll 

25 (APO 03); a gene encoding lipoprotein lipase (LPL); ATP-binding cassette 
transporter (ABC 1); paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 
5, 1 0-methylenetetrahydrofolate r reductase (MTHFR); a gene encoding 
hepatic lipase, E-selectin, G protein beta 3 subunit and angiotensin II type 
1 receptor gene. 
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55. A kit for indicating whether a subject has a predisposition to 
developing cardiovascular disease, comprising: 

(a) at least one probe or primer that specifically hybridizes 
adjacent to or at a polymorphic region of a cytochrome C oxidase 

5 subunit VIb (COX6B) gene associated with high serum cholesterol; 

and 

(b) optionally instructions for use. 

56. The kit of claim 55, wherein the polymorphic region 
comprises nucleotide 86 of the coding strand. 

10 57. A kit for indicating whether a subject has a predisposition to 

developing cardiovascular disease, comprising: 

(a) at least one probe or primer that specifically hybridizes 
adjacent to or at a polymorphic region of a cytochrome C oxidase 
subunit VIb (COX6B) gene associated with high cholesterol; 
15 (b) at least one probe or primer that specifically hybridizes 

adjacent to or at a polymorphic region of another gene associated 
with cardiovascular disease; and 

(c) optionally instructions for use. 

58. The kit of claim 57, wherein the other gene associated with 
20 cardiovascular disease is selected from the group of genes consisting of 

N-acetylglucosaminyl transferase component GPI-1 (GPI-1); cholesterol 
ester transfer protein, plasma (CETP); apolipoprotein A-IV (APO A4); 
apolipoprotein A-l (APO Al); apolipoprotein E (APO E); apolipoprotein B 
(APO B); apolipoprotein C-lll (APO C3); a gene encoding lipoprotein lipase 
25 (LPL); ATP-binding cassette transporter (ABC 1); paraoxonase 1 (PON 1); 
paraoxonase 2 (PON 2); 5, 1 0-methylenetetrahydrofolate r reductase 
(MTHFR); a gene encoding hepatic lipase, E-selectin, G protein beta 3 
subunit and angiotensin II type 1 receptor gene. 

59. A kit for indicating whether a subject has a predisposition to 
30 developing cardiovascular disease, comprising: 
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(a) at least one probe or primer that specifically hybridizes 
adjacent to or at a polymorphic region of a N-acetylglucosaminyl 
transferase component GPI-1 (GPI-1) gene associated with low 
serum high density lipoprotein (HDL); and 

5 (b) optionally instructions for use. 

60. The kit of claim 59, wherein the polymorphic region 
comprises nucleotide 2577 of the coding strand. 

61 . A kit for indicating whether a subject has a predisposition to 
developing cardiovascular disease, comprising: 

10 (a) at least one probe or primer that specifically hybridizes 

adjacent to or at a polymorphic region of a N-acetylglucosaminyl 
transferase component GPI-1 (GPI-I) gene associated with low 
serum high density lipoprotein (HDL); 

(b) at least one probe or primer that specifically hybridizes 
15 adjacent to or at a polymorphic region of another gene associated 

with cardiovascular disease; and 

(c) optionally instructions for use. 

62. The kit of claim 61, wherein the other gene associated with 
cardiovascular disease is selected from the group of genes consisting of 

20 cytochrome C oxidase subunit VIb (COX6B); cholesterol ester transfer 

protein, plasma (CETP); apolipoprotein A-IV (APO A4); apolipoprotein A-l 
(APO A1); apolipoprotein E (APO E); apolipoprotein B (APO B); 
apolipoprotein C-lll (APO C3); a gene encoding lipoprotein lipase (LPL); 
ATP-binding cassette transporter (ABC 1); paraoxonase 1 (PON 1); 

25 paraoxonase 2 (PON 2); 5, 1 0-methylenetetrahydrofolate r reductase 
(MTHFR); a gene encoding hepatic lipase, E-selectin, G protein beta 3 
subunit and angiotensin II type 1 receptor gene. 

63. A kit for indicating whether a subject has a predisposition to 
developing cardiovascular disease, comprising: 
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(a) at least one probe or primer that specifically hybridizes 
adjacent to or at a polymorphic region of a cytochrome C oxidase 
subunit VIb (COX6B) gene associated with high cholesterol; 

(b) at least one probe or primer that specifically hybridizes 
5 adjacent to or at a polymorphic region of a N-acetylglucosaminyl 

transferase component GPI-1 (GP1-1) gene associated with low 
HDL; and 

(c) optionally instructions for use. 

64. The kit of claim 63, further comprising at least one probe or 
10 primer that specifically hybridizes adjacent to or at a polymorphic region 

of another gene associated with cardiovascular disease. 

65. The kit of claim 64, wherein the other gene associated with 
cardiovascular disease is selected from the group of genes consisting of 
cholesterol ester transfer protein, plasma (CETP); apolipoprotein A-IV 

15 (APO A4); apolipoprotein A-l (APO A1); apolipoprotein E (APO E); 

apolipoprotein B (APO B); apolipoprotein C-lll (APO C3); a gene encoding 
lipoprotein lipase (LPL); ATP-binding cassette transporter (ABC 1); 
paraoxonase 1 (PON 1); paraoxonase 2 (PON 2); 5,10- 
methylenetetrahydrofolate r reductase (MTHFR); a gene encoding hepatic 

20 lipase, E-selectin, G protein beta 3 subunit and angiotensin II type 1 
receptor gene. 

66. A method of diagnosing a predisposition to cardiovascular 
disease in a human, said method comprising the steps of: 

(a) obtaining a biological sample from the human; 
25 (b) isolating DNA from the biological sample; and 

(c) detecting the presence or absence of at least one 
allelic variant of a cytochrome C oxidase subunit VIb (COX6B) gene 
in the DNA. 
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67. The method of claim 66, wherein at least one variant is a C 
to T transversion at position 86 of the cytochrome C oxidase subunit VIb 

- gene (C0X6B) coding region. 

68. The method of claim 66, further comprising the step of: 
5 detecting the presence or absence of at least one allelic 

variant of a second gene associated with cardiovascular disease. 

69. The method of claim 68, wherein the second gene is 
selected from the group consisting of human N-acetylglucosaminyl 
transferase component GPI-1 (GPI-1); cholesterol ester transfer protein, 

10 plasma (CETP); apolipoprotein A-IV (APO A4); apolipoprotein A-l (APO 

A1); apolipoprotein E (APO E); apolipoprotein B (APO B); apolipoprotein C- 
III (APO C3); a gene encoding lipoprotein lipase (LPL); ATP-binding 
cassette transporter (ABC 1); paraoxonase 1 (PON 1); paraoxonase 2 
(PON 2); 5,10-methylenetetrahydrofolate r reductase (MTHFR); a gene 

15 encoding hepatic lipase, E-selectin, G protein beta 3 subunit and 
angiotensin II type 1 receptor gene. 

70. The method of claim 68, wherein the detecting step is 
performed by an assay selected from the group consisting of allele 
specific hybridization, primer specific extension, oligonucleotide ligation, 

20 restriction enzyme site analysis, and single-stranded conformation 
polymorphism analysis. 

71. A method of diagnosing a predisposition to cardiovascular 
disease in a human, said method comprising the steps of: 

(a) obtaining a biological sample from the human; 
25 (b) isolating DNA from the biological sample; and 

(c) detecting the presence or absence of at least one 
allelic variant of a N-acetylglucosaminyl transferase component GPI- 
1 (GPI-1) gene in the DNA. 

72. The method of claim 71, wherein the detecting step is 
30 performed by an assay selected from the group consisting of allele 
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specific hybridization, primer specific extension, oligonucleotide ligation, 
restriction enzyme site analysis, and single-stranded conformation 
polymorphism analysis. 

73. The method of claim 71, wherein at least one variant is a G 
5 to A transversion at position 2577 of a N-acetylglucosaminyl transferase 

component GPI-1 (GPI-1) gene. 

74. A method of determining a response of a human to a 
cardiovascular drug, said method comprising the steps of: 

(a) obtaining a biological sample from the human; 
10 (b) isolating DNA from the biological sample; and 

(c) detecting the presence or absence of at least one 
allelic variant of a cytochrome C oxidase subunit VIb {C0X6B) gene 
in the DNA or at least one allelic variant of a N-acetylglucosaminyl 
transferase component GPI-1 {GPI-1 ) gene in the DNA. 
15 75. The method of claim 74, wherein the detecting step is 

performed by an assay selected from the group consisting of allele 
specific hybridization, primer specific extension, oligonucleotide ligation, 
restriction enzyme site analysis, and single-stranded conformation 
polymorphism analysis. 
20 76. A microarray, comprising: 

an isolated nucleic acid molecule comprising a sequence of 
nucleotides of a polymorphic region from a human cytochrome C oxidase 
subunit VIb (C0X6B) gene linked to a solid support. 

77. The microarray of claim 76, wherein the polymorphic region 
25 comprises position 86 of the human cytochrome C oxidase subunit VIb 

(C0X6B) coding region. 

78. A microarray, comprising: 

an isolated nucleic acid molecule comprising a sequence of 
nucleotides uence of a polymorphic region from a human N- 
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acetylglucosaminyl transferase component GPI-1 (GPI-1) gene linked to a 
solid support. 

79. The microarray of claim 78, wherein the polymorphic region 
comprises a locus selected from the group consisting of position 2577 of 

5 the human N-acetylglucosaminyl transferase component GPI-1 (GPI-1) 
gene, position 2829 of the human GPI-1 gene, position 2519 of the 
human GPI-1 gene, position 2289 of the human GPI-1 gene, position 
1938 of the human GPI-1 gene, position 1563 of the human GPI-1 gene, 
position 2656 of the human GPI-1 gene, and position 2664 of the human 
10 GPI-1 gene. 

80. The microarray of claim 91, wherein the polymorphic region 
comprises position 2577 of the human N-acetylglucosaminyl transferase 
component GPI-1 (GPI-1) gene. 
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SEQUENCE LISTING 

<110> Braun, Andreas 
Bonsai Aruna 
Kleyn Patrick 

<12 0> GENES AND POLYMORPHISMS ASSOCIATED WITH 
CARDIOVASCULAR DISEASE AND THEIR USE 

<130> 24736-2048PC 

<140> Unassigned 
<141> Herewith 

<140> 09/802,640 
<141> 03-09-2001 

<160> 122 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 439 

<212> DNA 

<213> Homo Sapien 

<220> 

<221> CDS 

<222> (45) . . . (305) 

<400> 1 

ttgagctgca ggttgaatcc ggggtgcctt taggattcag cacc atg gcg gaa gac 56 

Met Ala Glu Asp 
1 

atg gag acc aaa ate aag aac tac aag acc gcc cot ttt gac age egc 104 
Met Glu Thr Lys lie Lys Asn Tyr Lys Thr Ala Pro Phe Asp Ser Arg 
5 10 15 20 

ttc ccc aac eag aac cag act aga aac tge tgg cag aac tac ctg gac 152 
Phe Pro Asn Gin Asn Gin Thr Arg Asn Cys Trp Gin Asn Tyr Leu Asp 
25 30 35 

ttc cac cgc tgt cag aag gca atg acc get aaa gga ggc gat ate tct 200 
Phe His Arg Cys Gin Lys Ala Met Thr Ala Lys Gly Gly Asp lie Ser 
40 45 50 

9tg tgc gaa tgg tac cag cgt gtg tac cag tec etc tgc ccc aca tee 248 
Val Cys Glu Trp Tyr Gin Arg Val Tyr Gin Ser Leu Cys Pro Thr Ser 
55 60 65 

tgg gtc aca gac tgg gat gag eaa egg get gaa ggc aeg ttt ccc ggg 2 96 

Trp Val Thr Asp Trp Asp Glu Gin Arg Ala Glu Gly Thr Phe Pro Gly 
70 75 80 

aag ate tga actggctgca tctccetttc ctctgtcctc catecttctc 345 
Lys He * 
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85 

ccaggatggt gaagggggac ctggtaccca gtgatcccca ccccaggatc ctaaatcatg 405 
acttacctgc taataaaaac tcattggaaa agtg 439 

<210> 2 

<211> 86 

<212> PRT 

<213> Homo Sapien 

<400> 2 

Met Ala Glu Asp Met Glu Thr Lys lie Lys Asn Tyr Lys Thr Ala Pro 

15 10 15 

Phe Asp Ser Arg Phe Pro Asn Gin Asn Gin Thr Arg Asn Cys Trp Gin 

20 25 30 

Asn Tyr Leu Asp Phe His Arg Cys Gin Lys Ala Met Thr Ala Lys Gly 

35 40 45 

Gly Asp lie Ser Val Cys Glu Trp Tyr Gin Arg Val Tyr Gin Ser Leu 

50 55 60 

Cys Pro Thr Ser Trp Val Thr Asp Trp Asp Glu Gin Arg Ala Glu Gly 
65 70 75 80 

Thr Phe Pro Gly Lys lie 
85 

<210> 3 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 3 

aggattcagc accatggc 18 

<210> 4 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 4 

agcggataac aatttcacac aggtagtctg gttctggttg ggg 43 

<210> 5 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MassExtend primer 
<400> 5 

aatcaagaac tacaagac 18 


<210> 6 
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<211> 2921 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (103) . . . (1848) 
<400> 6 

cagcgagcgc cgtcgtctgc ccgggcccgc ccatcggggt ccccaacccc atccggaccc 60 
cgccgcccga gcgcgcggcc ccggaagcac ccgcctcccg gc atg gtg etc aag 114 

Met Val Leu Lys 
1 

gcc ttc ttc ccc acg tgc tgc gtc teg gcg gac age ggg ctg ctg gtg 162 
Ala Phe Phe Pro Thr Cys Cys Val Ser Ala Asp Ser Gly Leu Leu Val 
5 10 15 20 

gga egg tgg gtg ecg gag eag age age gee gtg gte etg geg gte ctg 210 
Gly Arg Trp Val Pro Glu Gin Ser Ser Ala Val Val Leu Ala Val Leu 
25 30 35 

cac ttt ccc ttc ate ccc ate eag gte aag eag etc ctg gcc eag gtg 258 
His Phe Pro Phe lie Pro lie Gin Val Lys Gin Leu Leu Ala Gin Val 
40 45 50 

egg eag gee age eag gtg gge gtg gcc gtg ctg ggc acc tgg tgc cac 306 
Arg Gin Ala Ser Gin Val Gly Val Ala Val Leu Gly Thr Trp Cys His 
55 60 65 

tgc egg eag gag ccc gag gag age etg gge ege ttc etg gag age ctg 354 
Cys Arg Gin Glu Pro Glu Glu Ser Leu Gly Arg Phe Leu Glu Ser Leu 
70 75 80 

ggt get gtc ttc ccc cat gag ccc tgg ctg egg ctg tgc egg gag aga 402 
Gly Ala Val Phe Pro His Glu Pro Trp Leu Arg Leu Cys Arg Glu Arg 
85 90 95 100 

ggc gge aeg ttc tgg age tgc gag gee ace cac egg caa gcg ccc act 450 
Gly Gly Thr Phe Trp Ser Cys Glu Ala Thr His Arg Gin Ala Pro Thr 
105 110 115 

gee ccc ggt gee ect ggt gag gac eag gtc atg etc ate ttc tat gac 498 
Ala Pro Gly Ala Pro Gly Glu Asp Gin Val Met Leu lie Phe Tyr Asp 
120 125 130 

eag ege eag gtg ttg etg tea eag eta cac ctg ccc ace gte etg ccc 546 
Gin Arg Gin Val Leu Leu Ser Gin Leu His Leu Pro Thr Val Leu Pro 
135 140 145 

gac ege eag get gga gee ace act gee age acg ggg ggc etg get gcc 594 
Asp Arg Gin Ala Gly Ala Thr Thr Ala Ser Thr Gly Gly Leu Ala Ala 
150 155 160 

gtc ttc gac aeg gta gea ege agt gag gtg etc ttc cgc agt gac cgc 642 
Val Phe Asp Thr Val Ala Arg Ser Glu Val Leu Phe Arg Ser Asp Arg 
165 170 175 180 
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ttt gat gag ggc ccc gtg egg ctg age cac tgg cag teg gag gge gtg 690 
Phe Asp Glu Gly Pro Val Arg Leu Ser His Trp Gin Ser Glu Gly Val 
185 190 195 

gag gee age ate cte gcg gag etg gee agg ega gee teg gga eee att 738 
Glu Ala Ser lie Leu Ala Glu Leu Ala Arg Arg Ala Ser Gly Pro lie 
200 205 210 

tgt ctg ctg ttg gee age ctg etg teg ctg gtc tea get gtc agt gcc 786 
Cys Leu Leu Leu Ala Ser Leu Leu Ser Leu Val Ser Ala Val Ser Ala 
215 220 225 

tgc cga gtg tte aag cte tgg ccc ctg tec ttc etc ggg age aaa etc 834 
Cys Arg Val Phe Lys Leu Trp Pro Leu Ser Phe Leu Gly Ser Lys Leu 
230 235 240 

tee aeg tgc gaa cag etc egg cac egg etg gag eae etc acg eta ate 882 
Ser Thr Cys Glu Gin Leu Arg His Arg Leu Glu His Leu Thr Leu lie 
245 250 255 260 

ttc agt aea egg aag gcg gag aac eet gee cag ctg atg agg aag gcc 93 0 

Phe Ser Thr Arg Lys Ala Glu Asn Pro Ala Gin Leu Met Arg Lys Ala 
265 270 275 

aae aeg gtg gcc tct gtg ctg ctg gac gtg gcc ctg ggc cte atg ctg 978 
Asn Thr Val Ala Ser Val Leu Leu Asp Val Ala Leu Gly Leu Met Leu 
280 285 290 

ctg tec tgg cte eae ggg aga age cgc ate ggg cat ctg gcc gac gcc 1026 
Leu Ser Trp Leu His Gly Arg Ser Arg lie Gly His Leu Ala Asp Ala 
295 300 305 

etc gtt eet gtg get gac eae gtg gee gag gag etc cag cat ctg ctg 1074 
Leu Val Pro Val Ala Asp His Val Ala Glu Glu Leu Gin His Leu Leu 
310 315 320 

cag tgg ctg atg ggt get ccc gee ggg etc aag atg aac cgt gca ctg 1122 
Gin Trp Leu Met Gly Ala Pro Ala Gly Leu Lys Met Asn Arg Ala Leu 
325 330 335 340 

gac cag gtg ctg ggc cgc ttc ttc etc tac cac ate cac ctg tgg ate 1170 
Asp Gin Val Leu Gly Arg Phe Phe Leu Tyr His lie His Leu Trp lie 
345 350 355 

age tac ate cac cte atg tec eee ttc gtg gag cac ate ctt tgg eae 1218 
Ser Tyr lie His Leu Met Ser Pro Phe Val Glu His lie Leu Trp His 
360 365 370 

9^9 99c etc teg gcc tgc ctg ggc etg aeg gtg gcc ctg tee cte etc 1266 
Val Gly Leu Ser Ala Cys Leu Gly Leu Thr Val Ala Leu Ser Leu Leu 
375 380 385 

teg gac att ate gee etc cte ace ttc cac ate tac tgc ttt tac gtc 1314 
Ser Asp lie lie Ala Leu Leu Thr Phe His lie Tyr Cys Phe Tyr Val 
390 395 400 

tat gga gcc agg ctg tac tgc etg aag ate cat ggc ctg tec tea ctg 1362 
Tyr Gly Ala Arg Leu Tyr Cys Leu Lys lie His Gly Leu Ser Ser Leu 
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405 


410 


415 


420 


tgg cgt ctg ttc egg ggg aag aag tgg aac gtt ctg cgc cag cgc gtg 
Trp Arg Leu Phe Arg Gly Lys Lys Trp Asn Val Leu Arg Gin Arg Val 
425 430 435 


1410 


gac tec tgt tec tat gac ctg gae cag ctg ttc ate ggg act ctg etc 
Asp Ser Cys Ser Tyr Asp Leu Asp Gin Leu Phe lie Gly Thr Leu Leu 
440 445 450 


1458 


ttc ace ate ctg etc ttc etc ctg cct aec aca gee ctg tac tac ctg 
Phe Thr lie Leu Leu Phe Leu Leu Pro Thr Thr Ala Leu Tyr Tyr Leu 
455 460 465 


1506 


gtg ttc ace ctg etc egg etc ctg gtg gtc gcc gtg cag ggc ctg ate 
Val Phe Thr Leu Leu Arg Leu Leu Val Val Ala Val Gin Gly Leu lie 
470 475 480 


1554 


eat ctg ctg gtg gac etc ate aac tec ctg ccg ctg tac tea ctg ggt 
His Leu Leu Val Asp Leu lie Asn Ser Leu Pro Leu Tyr Ser Leu Gly 
485 490 495 500 


1602 


ett egg etc tgc egg cec tac agg ctg gcg get ggc gtg aag ttc cgt 
Leu Arg Leu Cys Arg Pro Tyr Arg Leu Ala Ala Gly Val Lys Phe Arg 
505 510 515 


1650 


gtc etc egg cae gag gcc age agg cec etc cgc etc ctg atg cag ata 
Val Leu Arg His Glu Ala Ser Arg Pro Leu Arg Leu Leu Met Gin lie 
520 525 530 


1698 


aac cea ctg cec tac age cgc gtg gtg cae aec tac ege etc cec age 
Asn Pro Leu Pro Tyr Ser Arg Val Val His Thr Tyr Arg Leu Pro Ser 
535 540 545 


1746 


tgt ggc tgc cae cec aag cae tec tgg ggc gcc ctg tgc cgc aag ctg 
Cys Gly Cys His Pro Lys His Ser Trp Gly Ala Leu Cys Arg Lys Leu 
550 555 560 


1794 


ttc ett ggg gag etc ate tac cec tgg agg cag aga ggg gac aag cag 
Phe Leu Gly Glu Leu lie Tyr Pro Trp Arg Gin Arg Gly Asp Lys Gin 
565 570 575 580 


1842 


gac tga 
Asp * 


gggaactget ggctegcctg geaccaccae aeggceacag ccagccatct 


1898 


gctetgceag 
tgtgctcctg 
gagcagatgt 
tgggacccgc 
ccagagcact 
cagceccagg 
ggggtggagg 
gecgaacage 
eccgggcect 
gctttgtgaa 
cagccteagt 
cgtageagca 


ggtggeaeca 
aacaeggcag 
gggggtggce 
ttceeacctg 
gcceeatgee 
tggagggctg 
gaect tgcea 
accctgcate 
cactecctga 
gaaaccatea 
gtcaagggec 
ggtcctgegg 


gctcagetgg 
gccctgetat 
agecaggctg 
etgeggtcac 
caccctgeat 
gtctccctgg 
ggatgaacce 
tgggggattg 
accacaeggg 
gcaggetgtg 
tgceeaetga 
ecaaatctgt 


egeatgtece 
eacaccttgg 
geegcactee 
eatggtggeg 
acccaggtce 
gggetcceca 
tccagteeca 
aagcagtegc 
gtttatttgc 
ageatcgeca 
cccagccgta 
cteccttcat 


gtgetttgtg 
gettggaggt 
ateaetggca 
ageacagcaa 
agagggtecg 
gtggctctge 
ggcaecctet 
tgaeeecegt 
ggatgttecc 
ggctgctgtg 
ectattcgtc 
gggeetccca 


gacgctgetg 
cattgggagt 
etgectgect 
ecccaggtgt 
tccaccacag 
eetggctgtg 
agetecctca 
ccccagcggg 
tggagaggtc 
ggggcgggag 
cacggtgecc 
gggaaggagg 


1958 
2018 
2078 
2138 
2198 
2258 
2318 
2378 
2438 
2498 
2558 
2618 
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aagccctgct gtgcagacac ctctgtggcc ccccaggggt gtgagcggcc tggggagggg 2678 

gccgtggcac tgaggccgaa agtgcctgcc agacggcacg gtctgggtgc gggtgttccc 2738 

tgtgagcccg agtccgcttc aggaggggag cctgcaggtg ccggctggtg aggggatgac 2798 

gcgctgtggg tgggaggagg cagcgcccat ctcagcagca ccaggactgc ctgggactcc 2 858 

ctggcaaccc agcaccgggg aagccgtcag ctgctgtgac aataaaacct gccccgtgtc 2918 

tgg 2921 

<210> 7 

<211> 581 

<212> PRT 

<213> Homo Sapien 

<400> 7 

Met Val Leu Lys Ala Phe Phe Pro Thr Cys Cys Val Ser Ala Asp Ser 

15 10 15 

Gly Leu Leu Val Gly Arg Trp Val Pro Glu Gin Ser Ser Ala Val Val 

20 25 30 

Leu Ala Val Leu His Phe Pro Phe lie Pro lie Gin Val Lys Gin Leu 

35 40 45 

Leu Ala Gin Val Arg Gin Ala Ser Gin Val Gly Val Ala Val Leu Gly 

50 55 60 

Thr Trp Cys His Cys Arg Gin Glu Pro Glu Glu Ser Leu Gly Arg Phe 
65 70 75 80 

Leu Glu Ser Leu Gly Ala Val Phe Pro His Glu Pro Trp Leu Arg Leu 

85 90 95 

Cys Arg Glu Arg Gly Gly Thr Phe Trp Ser Cys Glu Ala Thr His Arg 

100 105 110 

Gin Ala Pro Thr Ala Pro Gly Ala Pro Gly Glu Asp Gin Val Met Leu 

115 120 125 

lie Phe Tyr Asp Gin Arg Gin Val Leu Leu Ser Gin Leu His Leu Pro 

130 135 140 

Thr Val Leu Pro Asp Arg Gin Ala Gly Ala Thr Thr Ala Ser Thr Gly 
145 150 155 160 

Gly Leu Ala Ala Val Phe Asp Thr Val Ala Arg Ser Glu Val Leu Phe 

165 170 175 

Arg Ser Asp Arg Phe Asp Glu Gly Pro Val Arg Leu Ser His Trp Gin 

180 185 190 

Ser Glu Gly Val Glu Ala Ser lie Leu Ala Glu Leu Ala Arg Arg Ala 

195 200 205 

Ser Gly Pro lie Cys Leu Leu Leu Ala Ser Leu Leu Ser Leu Val Ser 

210 215 220 

Ala Val Ser Ala Cys Arg Val Phe Lys Leu Trp Pro Leu Ser Phe Leu 
225 230 235 240 

Gly Ser Lys Leu Ser Thr Cys Glu Gin Leu Arg His Arg Leu Glu His 

245 250 255 

Leu Thr Leu lie Phe Ser Thr Arg Lys Ala Glu Asn Pro Ala Gin Leu 

260 265 270 

Met Arg Lys Ala Asn Thr Val Ala Ser Val Leu Leu Asp Val Ala Leu 

275 280 285 

Gly Leu Met Leu Leu Ser Trp Leu His Gly Arg Ser Arg lie Gly His 

290 295 300 

Leu Ala Asp Ala Leu Val Pro Val Ala Asp His Val Ala Glu Glu Leu 
305 310 315 320 

Gin His Leu Leu Gin Trp Leu Met Gly Ala Pro Ala Gly Leu Lys Met 

325 330 335 

Asn Arg Ala Leu Asp Gin Val Leu Gly Arg Phe Phe Leu Tyr His lie 

340 345 350 

His Leu Trp lie Ser Tyr lie His Leu Met Ser Pro Phe Val Glu His 
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355 





360 





365 




He 

Leu 

Trp 

His 

Val 

Gly 

Leu 

Ser 

Ala 

Cys 

Leu 

Gly 

Leu 

Thr 

Val 

Ala 


370 





375 





380 





Leu 

Ser 

Leu 

Leu 

Ser 

Asp 

He 

He 

Ala 

Leu 

Leu 

Thr 

Phe 

His 

He 

Tyr 

385 





390 





395 





400 

Cys 

Phe 

Tyr 

Val 

Tyr 

Gly 

Ala 

Arg 

Leu 

Tyr 

Cys 

Leu 

Lys 

He 

His 

Gly 





405 





410 





415 


Leu 

Ser 

Ser 

Leu 

Trp 

Arg 

Leu 

Phe 

Arg 

Gly 

Lys 

Lys 

Trp 

Asn 

Val 

Leu 




420 





425 





430 



Arg 

Gin 

Arg 

Val 

Asp 

Ser 

Cys 

Ser 

Tyr 

Asp 

Leu 

Asp 

Gin 

Leu 

Phe 

He 



435 





440 





445 




Gly 

Thr 

Leu 

Leu 

Phe 

Thr 

He 

Leu 

Leu 

Phe 

Leu 

Leu 

Pro 

Thr 

Thr 

Ala 


450 





455 





460 





Leu 

Tyr 

Tyr 

Leu 

Val 

Phe 

Thr 

Leu 

Leu 

Arg 

Leu 

Leu 

Val 

Val 

Ala 

Val 

465 





470 





475 





480 

Gin 

Gly 

Leu 

He 

His 

Leu 

Leu 

Val 

Asp 

Leu 

He 

Asn 

Ser 

Leu 

Pro 

Leu 





485 





490 





495 


Tyr 

Ser 

Leu 

Gly 

Leu 

Arg 

Leu 

Cys 

Arg 

Pro 

Tyr 

Arg 

Leu 

Ala 

Ala 

Gly 




500 





505 





510 



Val 

Lys 

Phe 

Arg 

Val 

Leu 

Arg 

His 

Glu 

Ala 

Ser 

Arg 

Pro 

Leu 

Arg 

Leu 



515 





520 





525 




Leu 

Met 

Gin 

He 

Asn 

Pro 

Leu 

Pro 

Tyr 

Ser 

Arg 

Val 

Val 

His 

Thr 

Tyr 


530 





535 





540 





Arg 

Leu 

Pro 

Ser 

Cys 

Gly 

Cys 

His 

Pro 

Lys 

His 

Ser 

Trp 

Gly 

Ala 

Leu 

545 





550 





555 





560 

Cys 

Arg 

Lys 

Leu 

Phe 

Leu 

Gly 

Glu 

Leu 

He 

Tyr 

Pro 

Trp 

Arg 

Gin 

Arg 





565 





570 





575 


Gly 

Asp 

Lys 

Gin 

Asp 













580 


<210> 8 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 8 

agcagggctt cctccttc 18 

<210> 9 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 9 

agcggataac aatttcacac aggtgaccca gccgtaccta ttc 43 

<210> 10 
<211> 18 
<212> DNA 

<213> Artificial Sequence 


<220> 
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<223> MassExtend primer 
<400> 10 

aagggagaca gatttggc 18 

<210> 11 

<211> 1790 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (131) . . . (1612) 

<223> Nucleotide sequence encoding Cholesterol ester 
transfer protein (CETP) 

<400> 11 

gtgaatctct ggggccagga agaccctgct gcccggaaga gcctcatgtt ccgtgggggc 60 
tgggcggaca tacatatacg ggctccaggc tgaacggctc gggccactta cacaccactg 120 
cctgataacc atg ctg get gcc aca gtc ctg acc ctg gcc ctg ctg ggc 169 
Met Leu Ala Ala Thr Val Leu Thr Leu Ala Leu Leu Gly 
15 10 

aat gcc cat gcc tgc tec aaa ggc acc teg eae gag gca ggc ate gtg 217 
Asn Ala His Ala Cys Ser Lys Gly Thr Ser His Glu Ala Gly lie Val 
15 20 25 

tgc cgc ate ace aag cet gcc etc ctg gtg ttg aac eae gag act gcc 265 
Cys Arg lie Thr Lys Pro Ala Leu Leu Val Leu Asn His Glu Thr Ala 
30 35 40 45 

aag gtg ate eag acc gcc ttc cag ega gee age tac cca gat ate acg 313 
Lys Val lie Gin Thr Ala Phe Gin Arg Ala Ser Tyr Pro Asp lie Thr 
50 55 60 

ggc gag aag gcc atg atg etc ctt ggc caa gtc aag tat ggg ttg eae 361 
Gly Glu Lys Ala Met Met Leu Leu Gly Gin Val Lys Tyr Gly Leu His 
65 70 75 

aac ate cag ate age eae ttg tec ate gcc age age cag gtg gag ctg 409 
Asn lie Gin lie Ser His Leu Ser lie Ala Ser Ser Gin Val Glu Leu 
80 85 90 

gtg gaa gee aag tec att gat gtc tec att eag aac gtg tet gtg gtc 457 
Val Glu Ala Lys Ser lie Asp Val Ser He Gin Asn Val Ser Val Val 
95 100 105 

ttc aag ggg acc ctg aag tat ggc tac ace act gcc tgg tgg ctg ggt 505 
Phe Lys Gly Thr Leu Lys Tyr Gly Tyr Thr Thr Ala Trp Trp Leu Gly 
110 115 120 125 

att gat eag tee att gac ttc gag ate gae tet gee att gae etc eag 553 
lie Asp Gin Ser He Asp Phe Glu He Asp Ser Ala He Asp Leu Gin 
130 135 140 

ate aac aca eag ctg ace tgt gac tet ggt aga gtg egg acc gat gee 601 
He Asn Thr Gin Leu Thr Cys Asp Ser Gly Arg Val Arg Thr Asp Ala 
145 150 155 
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cct gac tgc tac ctg tct ttc cat aag ctg etc ctg cat etc caa ggg 649 
Pro Asp Cys Tyr Leu Ser Phe His Lys Leu Leu Leu His Leu Gin Gly 
160 165 170 

gag cga gag cct ggg tgg ate aag cag ctg ttc aca aat ttc ate tec 697 
Glu Arg Glu Pro Gly Trp lie Lys Gin Leu Phe Thr Asn Phe lie Ser 
175 180 185 

ttc ace ctg aag ctg gtc ctg aag gga cag ate tgc aaa gag ate aac 745 
Phe Thr Leu Lys Leu Val Leu Lys Gly Gin lie Cys Lys Glu lie Asn 
190 195 200 205 

gtc ate tct aac ate atg gee gat ttt gtc cag aca agg get gee age 793 
Val lie Ser Asn He Met Ala Asp Phe Val Gin Thr Arg Ala Ala Ser 
210 215 220 

ate ctt tea gat gga gac att ggg gtg gac att tee ctg aca ggt gat 841 
lie Leu Ser Asp Gly Asp lie Gly Val Asp He Ser Leu Thr Gly Asp 
225 230 235 

ccc gtc ate aca gee tec tac ctg gag tee eat cac aag ggt eat ttc 889 
Pro Val He Thr Ala Ser Tyr Leu Glu Ser His His Lys Gly His Phe 
240 245 250 

ate tac aag aat gtc tea gag gac etc ccc etc ccc acc ttc teg ccc 937 
He Tyr Lys Asn Val Ser Glu Asp Leu Pro Leu Pro Thr Phe Ser Pro 
255 260 265 

aca ctg ctg ggg gac tec cgc atg ctg tac ttc tgg ttc tct gag cga 985 
Thr Leu Leu Gly Asp Ser Arg Met Leu Tyr Phe Trp Phe Ser Glu Arg 
270 275 280 285 

gtc ttc cac teg ctg gee aag gta get ttc cag gat ggc cgc etc atg 1033 
Val Phe His Ser Leu Ala Lys Val Ala Phe Gin Asp Gly Arg Leu Met 
290 295 300 

etc age ctg atg gga gac gag ttc aag gea gtg ctg gag acc tgg ggc 1081 
Leu Ser Leu Met Gly Asp Glu Phe Lys Ala Val Leu Glu Thr Trp Gly 
305 310 315 

ttc aac ace aac cag gaa ate ttc caa gag gtt gtc ggc ggc ttc ccc 1129 
Phe Asn Thr Asn Gin Glu He Phe Gin Glu Val Val Gly Gly Phe Pro 
320 325 330 

age cag gee caa gtc acc gtc cac tgc etc aag atg ccc aag ate tec 1177 
Ser Gin Ala Gin Val Thr Val His Cys Leu Lys Met Pro Lys He Ser 
335 340 345 

tgc caa aac aag gga gtc gtg gtc aat tct tea gtg atg gtg aaa ttc 1225 
Cys Gin Asn Lys Gly Val Val Val Asn Ser Ser Val Met Val Lys Phe 
350 355 360 365 

etc ttt cea cgc eca gac cag caa cat tct gta get tac aca ttt gaa 1273 
Leu Phe Pro Arg Pro Asp Gin Gin His Ser Val Ala Tyr Thr Phe Glu 
. 370 375 380 

gag gat ate gtg act acc gtc cag gee tec tat tct aag aaa aag etc 1321 
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Glu Asp lie Val Thr Thr Val Gin Ala Ser Tyr Ser Lys Lys Lys Leu 
385 390 395 

ttc tta age etc ttg gat ttc cag att aca cca aag act gtt tec aac 1369 
Phe Leu Ser Leu Leu Asp Phe Gin lie Thr Pro Lys Thr Val Ser Asn 
400 405 410 

ttg act gag age age tec gag tec ate cag age ttc etg cag tea atg 1417 
Leu Thr Glu Ser Ser Ser Glu Ser lie Gin Ser Phe Leu Gin Ser Met 
415 420 425 

ate ace get gtg ggc ate ect gag gtc atg tct egg etc gag gta gtg 1465 
lie Thr Ala Val Gly lie Pro Glu Val Met Ser Arg Leu Glu Val Val 
430 435 440 445 

ttt aca gee etc atg aac age aaa ggc gtg age etc ttc gae ate ate 1513 
Phe Thr Ala Leu Met Asn Ser Lys Gly Val Ser Leu Phe Asp lie lie 
450 455 460 

aac ect gag att ate act ega gat ggc ttc etg etg etg cag atg gae 1561 
Asn Pro Glu lie lie Thr Arg Asp Gly Phe Leu Leu Leu Gin Met Asp 
465 470 475 

ttt ggc ttc ect gag cae etg etg gtg gat ttc etc cag age ttg age 1609 
Phe Gly Phe Pro Glu His Leu Leu Val Asp Phe Leu Gin Ser Leu Ser 
480 485 490 

tag aagtctecaa ggaggtcggg atggggcttg tagcagaagg caageaccag 1662 


getcacaget ggaaeeetgg tgtctcctcc agegtggtgg aagttgggtt aggagtaegg 1722 
agatggagat tggcteeeaa ctectcceta tcetaaagge ccaetggcat taaagtgctg 1782 
tatccaag 1790 

<210> 12 

<211> 493 

<212> PRT 

<213> Homo sapien 


<400> 12 


Met 

Leu 

Ala 

Ala 

Thr 

Val 

Leu 

Thr 

Leu 

Ala 

Leu 

Leu 

Gly 

Asn 

Ala 

His 

1 




5 





10 





15 


Ala 

Cys 

Ser 

Lys 

Gly 

Thr 

Ser 

His 

Glu 

Ala 

Gly 

He 

Val 

Cys 

Arg 

He 




20 





25 





30 



Thr 

Lys 

Pro 

Ala 

Leu 

Leu 

Val 

Leu 

Asn 

His 

Glu 

Thr 

Ala 

Lys 

Val 

He 


35 





40 





45 




Gin 

Thr 

Ala 

Phe 

Gin 

Arg 

Ala 

Ser 

Tyr 

Pro 

Asp 

He 

Thr 

Gly 

Glu 

Lys 


50 





55 





60 





Ala 

Met 

Met 

Leu 

Leu 

Gly 

Gin 

Val 

Lys 

Tyr 

Gly 

Leu 

His 

Asn 

He 

Gin 

65 





70 





75 





80 

He 

Ser 

His 

Leu 

Ser 

He 

Ala 

Ser 

Ser 

Gin 

Val 

Glu 

Leu 

Val 

Glu 

Ala 





85 





90 





95 


Lys 

Ser 

He 

Asp 

Val 

Ser 

He 

Gin 

Asn 

Val 

Ser 

Val 

Val 

Phe 

Lys 

Gly 



100 





105 





110 



Thr 

Leu 

Lys 

Tyr 

Gly 

Tyr 

Thr 

Thr 

Ala 

Trp 

Trp 

Leu 

Gly 

He 

Asp 

Gin 



115 





120 





125 




Ser 

He 

Asp 

Phe 

Glu 

He 

Asp 

Ser 

Ala 

He 

Asp 

Leu 

Gin 

He 

Asn 

Thr 
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130 135 140 


Gin 

Leu 

Thr 

Cys 

Asp 

Ser 

Gly 

Arg 

Val 

Arg 

Thr 

Asp 

Ala 

Pro 

Asp 

Cys 

145 





150 





155 





160 

Tyr 

Leu 

Ser 

Phe 

His 

Lys 

Leu 

Leu 

Leu 

His 

Leu 

Gin 

Gly 

Glu 

Arg 

Glu 





165 





170 





175 


Pro 

Gly 

Trp 

He 

Lys 

Gin 

Leu 

Phe 

Thr 

Asn 

Phe 

He 

Ser 

Phe 

Thr 

Leu 




180 





185 





190 



Lys 

Leu 

Val 

Leu 

Lys 

Gly 

Gin 

He 

Cys 

Lys 

Glu 

He 

Asn 

Val 

He 

Ser 



195 





200 





205 




Asn 

He 

Met 

Ala 

Asp 

Phe 

Val 

Gin 

Thr 

Arg 

Ala 

Ala 

Ser 

He 

Leu 

Ser 


210 





215 





220 





Asp 

Gly 

Asp 

He 

Gly 

Val 

Asp 

He 

Ser 

Leu 

Thr 

Gly 

Asp 

Pro 

Val 

He 

225 





230 





235 





240 

Thr 

Ala 

Ser 

Tyr 

Leu 

Glu 

Ser 

His 

His 

Lys 

Gly 

His 

Phe 

He 

Tyr 

Lys 





245 





250 





255 


Asn 

Val 

Ser 

Glu 

Asp 

Leu 

Pro 

Leu 

Pro 

Thr 

Phe 

Ser 

Pro 

Thr 

Leu 

Leu 




260 




265 





270 



Gly 

Asp 

Ser 

Arg 

Met 

Leu 

Tyr 

Phe 

Trp 

Phe 

Ser 

Glu 

Arg 

Val 

Phe 

His 



275 





280 





285 




Ser 

Leu 

Ala 

Lys 

Val 

Ala 

Phe 

Gin 

Asp 

Gly 

Arg 

Leu 

Met 

Leu 

Ser 

Leu 


290 





295 





300 





Met 

Gly 

Asp 

Glu 

Phe 

Lys 

Ala 

Val 

Leu 

Glu 

Thr 

Trp 

Gly 

Phe 

Asn 

Thr 

305 





310 





315 





320 

Asn 

Gin 

Glu 

He 

Phe 

Gin 

Glu 

Val 

Val 

Gly 

Gly 

Phe 

Pro 

Ser 

Gin 

Ala 





325 





330 





335 


Gin 

Val 

Thr 

Val 

His 

Cys 

Leu 

Lys 

Met 

Pro 

Lys 

He 

Ser 

Cys 

Gin 

Asn 




340 





345 





350 



Lys 

Gly 

Val 

Val 

Val 

Asn 

Ser 

Ser 

Val 

Met 

Val 

Lys 

Phe 

Leu 

Phe 

Pro 

355 





360 





365 




Arg 

Pro 

Asp 

Gin 

Gin 

His 

Ser 

Val 

Ala 

Tyr 

Thr 

Phe 

Glu 

Glu 

Asp 

He 


370 





375 





380 





Val 

Thr 

Thr 

Val 

Gin 

Ala 

Ser 

Tyr 

Ser 

Lys 

Lys 

Lys 

Leu 

Phe 

Leu 

Ser 

385 





390 





395 





400 

Leu 

Leu 

Asp 

Phe 

Gin 

He 

Thr 

Pro 

Lys 

Thr 

Val 

Ser 

Asn 

Leu 

Thr 

Glu 




405 





410 





415 


Ser 

Ser 

Ser 

Glu 

Ser 

He 

Gin 

Ser 

Phe 

Leu 

Gin 

Ser 

Met 

He 

Thr 

Ala 




420 





425 





430 



Val 

Gly 

He 

Pro 

Glu 

Val 

Met 

Ser 

Arg 

Leu 

Glu 

Val 

Val 

Phe 

Thr 

Ala 


435 





440 





445 




Leu 

Met 

Asn 

Ser 

Lys 

Gly 

Val 

Ser 

Leu 

Phe 

Asp 

He 

He 

Asn 

Pro 

Glu 


450 





455 





460 





He 

He 

Thr 

Arg 

Asp 

Gly 

Phe 

Leu 

Leu 

Leu 

Gin 

Met 

Asp 

Phe 

Gly 

Phe 

465 





470 





475 





480 

Pro 

Glu 

His 

Leu 

Leu 

Val 

Asp 

Phe 

Leu 

Gin 

Ser 

Leu 

Ser 





485 490 


<210> 13 

<211> 3549 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (175) . . . (1602) 

<223> Nucleotide sequence encoding lipoprotein lipase 
(LPL) 
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cccctcttcc tcctcctcaa gggaaagctg cccacttcta gctgccctgc catccccttt 60 
aaagggcgac ttgctcagcg ccaaaccgcg gctccagccc tctccagcct ccggctcagc 120 
cggctcatca gtcggtccgc gccttgcagc tcctccagag ggacgcgccc cgag atg 177 

Met 
1 


gag age aaa gcc ctg etc gtg ctg act ctg gcc gtg tgg etc cag agt 
Glu Ser Lys Ala Leu Leu Val Leu Thr Leu Ala Val Trp Leu Gin Ser 
5 10 15 


225 


ctg acc gcc tec cgc gga ggg gtg gcc gcc gcc gac caa aga aga gat 
Leu Thr Ala Ser Arg Gly Gly Val Ala Ala Ala Asp Gin Arg Arg Asp 
20 25 30 


273 


ttt ate gac ate gaa agt aaa ttt gcc eta agg acc cct gaa gac aca 
Phe He Asp He Glu Ser Lys Phe Ala Leu Arg Thr Pro Glu Asp Thr 
35 40 45 


321 


get gag gac act tgc eac etc att cce gga gta gea gag tec gtg get 
Ala Glu Asp Thr Cys His Leu He Pro Gly Val Ala Glu Ser Val Ala 
50 55 60 65 


369 


acc tgt cat tte aat eac age age aaa acc tte atg gtg ate eat ggc 
Thr Cys His Phe Asn His Ser Ser Lys Thr Phe Met Val He His Gly 
70 75 80 


417 


tgg acg gta aca gga atg tat gag agt tgg gtg cca aaa ctt gtg gee 
Trp Thr Val Thr Gly Met Tyr Glu Ser Trp Val Pro Lys Leu Val Ala 
85 90 95 


465 


gee ctg tac aag aga gaa cca gac tec aat gtc att gtg gtg gac tgg 
Ala Leu Tyr Lys Arg Glu Pro Asp Ser Asn Val He Val Val Asp Trp 
100 105 110 


513 


ctg tea egg get cag gag eat tac cca gtg tec gcg ggc tac acc aaa 
Leu Ser Arg Ala Gin Glu His Tyr Pro Val Ser Ala Gly Tyr Thr Lys 
115 120 125 


561 


ctg gtg gga cag gat gtg gcc egg ttt ate aac tgg atg gag gag gag 
Leu Val Gly Gin Asp Val Ala Arg Phe He Asn Trp Met Glu Glu Glu 
130 135 140 145 


609 


ttt aac tac cct ctg gae aat gtc cat etc ttg gga tac age ctt gga 
Phe Asn Tyr Pro Leu Asp Asn Val His Leu Leu Gly Tyr Ser Leu Gly 
150 155 160 


657 


gcc cat get get ggc att gea gga agt ctg acc aat aag aaa gtc aac 
Ala His Ala Ala Gly He Ala Gly Ser Leu Thr Asn Lys Lys Val Asn 
165 170 175 


705 


aga att act ggc etc gat cca get gga cct aac ttt gag tat gea gaa 
Arg He Thr Gly Leu Asp Pro Ala Gly Pro Asn Phe Glu Tyr Ala Glu 
180 185 190 


753 


gcc eeg agt cgt ctt tct cct gat gat gea gat ttt gta gac gtc tta 
Ala Pro Ser Arg Leu Ser Pro Asp Asp Ala Asp Phe Val Asp Val Leu 
195 200 205 


801 
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cac aca ttc 
His Thr Phe 
210 

cca gtt ggg 
Pro Val Gly 


gga tgt aac 
Gly Cys Asn 


gga gat gtg 
Gly Asp Val 
260 

etc ttc ate 
Leu Phe lie 
275 

agg tgc agt 
Arg Cys Ser 
290 

aga aag aac 
Arg Lys Asn 


gcc aaa aga 
Ala Lys Arg 


tac aaa gtc 
Tyr Lys Val 
340 

agt gaa acc 
Ser Glu Thr 
355 

gtg gcc gag 
Val Ala Glu 
370 

aat aag acc 
Asn Lys Thr 


eta etc atg 
Leu Leu Met 


tea gac tgg 
Ser Asp Trp 
420 

aaa gca gga 
Lys Ala Gly 


acc aga ggg 
Thr Arg Gly 
215 

eat gtt gae 
His Val Asp 
230 

att gga gaa 
lie Gly Glu 
245 

gac cag eta 
Asp Gin Leu 


gac tet ctg 
Asp Ser Leu 


tee aag gaa 
Ser Lys Glu 
295 

cgc tgc aac 
Arg Cys Asn 
310 

age age aaa 
Ser Ser Lys 
325 

ttc cat tac 
Phe His Tyr 


cat acc aat 
His Thr Asn 


agt gag aac 
Ser Glu Asn 
375 

tac tec ttc 
Tyr Ser Phe 
390 

ttg aag etc 
Leu Lys Leu 
405 

tgg age agt 
Trp Ser Ser 


gag act cag 
Glu Thr Gin 


tec cct ggt 
Ser Pro Gly 


att tac ccg 
lie Tyr Pro 


get ate cgc 
Ala lie Arg 
250 

gtg aag tgc 
Val Lys Cys 
265 

ttg aat gaa 
Leu Asn Glu 
280 

gcc ttt gag 
Ala Phe Glu 


aat ctg ggc 
Asn Leu Gly 


atg tac ctg 
Met Tyr Leu 
330 

caa gta aag 
Gin Val Lys 
345 

cag gcc ttt 
Gin Ala Phe 
360 

ate cca ttc 
lie Pro Phe 


eta att tac 
Leu lie Tyr 


aaa tgg aag 
Lys Trp Lys 
410 

ecc ggc ttc 
Pro Gly Phe 
425 

aaa aag gtg 
Lys Lys Val 


cga age att 
Arg Ser lie 
220 

aat gga ggt 
Asn Gly Gly 
235 

gtg att gca 
Val lie Ala 


tec cac gag 
Ser His Glu 


gaa aat cca 
Glu Asn Pro 
285 

aaa ggg etc 
Lys Gly Leu 

300 

tat gag ate 
Tyr Glu lie 
315 

aag act egt 
Lys Thr Arg 


att cat ttt 
He His Phe 


gag att tet 
Glu He Ser 
365 

act ctg cct 
Thr Leu Pro 
380 

aca gag gta 
Thr Glu Val 
395 

agt gat tea 
Ser Asp Ser 


gee att cag 
Ala He Gin 


ate ttc tgt 
He Phe Cys 


gga ate cag 
Gly He Gin 


act ttt cag 
Thr Phe Gin 
240 

gag aga gga 
Glu Arg Gly 
255 

cgc tec att 
Arg Ser He 
270 

agt aag gee 
Ser Lys Ala 


tgc ttg agt 
Cys Leu Ser 


aat aaa gtc 
Asn Lys Val 
320 

tet cag atg 
Ser Gin Met 
335 

tet ggg act 
Ser Gly Thr 
350 

ctg tat ggc 
Leu Tyr Gly 


gaa gtt tec 
Glu Val Ser 


gat att gga 
Asp He Gly 
400 

tac ttt age 
Tyr Phe Ser 
415 

aag ate aga 
Lys He Arg 
430 

tet agg gag 
Ser Arg Glu 


aaa 849 

Lys 

225 

cca 897 
Pro 


ctt 945 
Leu 


cat 993 
His 


tac 1041 
Tyr 


tgt 1089 

Cys 

305 

aga 1137 
Arg 


ecc 1185 
Pro 


gag 1233 
Glu 


acc 1281 
Thr 


aca 1329 

Thr 

385 

gaa 1377 
Glu 


tgg 1425 
Trp 


gta 1473 
Val 


aaa 1521 
Lys 
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435 


440 


445 


gtg tct cat ttg cag aaa gga aag gca cct gcg gta ttt gtg aaa tgc 
Val Ser His Leu Gin Lys Gly Lys Ala Pro Ala Val Phe Val Lys Cys 
450 455 ' 460 465 


1569 


cat gac aag tct ctg aat aag aag tea ggc tga aactgggcga atctacagaa 
His Asp Lys Ser Leu Asn Lys Lys Ser Gly * 
470 475 


1622 


caaagaacgg 
tacccagtgt 
cttgttagtt 
ggtatagtgg 
tgttttgtcc 
tgtggcgtat 
ggatctttcg 
gcttttccgc 
attccctctt 
ggacgaagaa 
tcatgacaaa 
ctttatcgtt 
taataatgtt 
ttgaatggtg 
ttcatcaatt 
gtcatgcttc 
taggtttgag 
tttactaagt 
catcgcgtgc 
ctaatcaaga 
ttctacgtat 
cccgactgtg 
ccttcagcat 
gtagaaattg 
ctataaatag 
aaaaataaaa 
ggatttgggt 
actgggaact 
ttttatttat 
aaggtcattg 
agtgatggtc 
aaacgtactt 
ttaccac 


catgtgaatt 
ttggggtgtt 
attttaggag 
ccaaatagca 
tttgagaaag 
tgggccatag 
gactgaggcc 
ggcacgaatc 
gctattggaa 
agggtctgat 
gttacaaatt 
tattgcttaa 
ctctgggtag 
cagaaaaaaa 
taaaatcatt 
agttgtactt 
ttcaaatttt 
aaaagggtgg 
aatgagccag 
gtgagtgaac 
aaatatgaaa 
aaagtatgtg 
aattcggaag 
ttcctgatgt 
taggaccaat 
tgatgtatga 
tgtgacccag 
ctggctccga 
tagctgtaaa 
tggctatctg 
tcacagagcc 
aactgtgtga 


ctgtgaagaa 
tcaaaagtgg 
acagtctcaa 
catcctccaa 
aaataattgt 
cctataattg 
t tctcaaact 
agactcatct 
tgtggtccag 
aaacacagag 
caaggagata 
tccctctctc 
gtgttgaaaa 
aaagataccg 
caatatctga 
ccagtgcgtc 
gcattgtgtg 
agaggttcct 
atggagtacc 
aactatttat 
tgataaagat 
atatctgaac 
ggaaaacagt 
gccagaactt 
gttgtgatta 
tttgttgttg 
ggtgcattaa 
aaaactttgt 
tacatgtgtg 
catttataaa 
aactcactct 
agaaatggaa 


tgaagtggag 
attttcctga 
gcactaaaaa 
cgttaaaaga 
ttgagcgcag 
gttagaacct 
ttactctaag 
acacagcagt 
acgtcaacca 
gttt taaaca 
taaaatctag 
ccccttcttt 
tgagcctgta 
taattttatt 
cagttactct 
tcttttgttc 
agcttctaca 

ggggtggatt 

atgagggttg 
aaactagatc 
gtcaaatatc 
acatactaga 
cgatcaaggg 
cgaccctttc 
acatcatcag 
gcatcccct t 
cttaaaagat 
tatatatatc 
gatgtgtaaa 
tgtgtggtgc 
tatgaaatgg 
tcagctttta 


gaagtaactt 
atattaatcc 
gtggctaatt 
cagtggatca 
agtaaaataa 
cctattttaa 
tctccaagaa 
atgaatgatg 
ggaacatgta 
gtccctacca 
atcaattaat 
tttgtctcaa 
atcctcagct 
attagattct 
tcagttttag 
ctggctttga 
gattttagac 
cctaagcagt 
ttatttgttg 
tcctattttt 
tcagaggcta 
aagctctgca 
atgtattgga 
tctgagagag 
gcttggaatg 
tattaattca 
tcactaaagc 
aaggatgttc 
tggagct tgt 
taactgtatg 
gctttaacaa 
ataaaattga 


ttacaaaaca 
cagccctacc 
caatttatgg 
tgaaaagtgc 
ggctccttca 
ttggaattct 
tacagaaaat 
ttttagaatg 
acttggagag 
ttggcctgca 
tcttaatagg 
gattatatta 
gacacataat 
ccaaatgatt 
gcttaccttg 
catgaaaaga 
aaggaccgtt 
gcttgtaaac 
tttttaacaa 
cagaatgctc 
tagctgggaa 
tgtgtgttgt 
acatgtcgga 
atgatcgtgc 
aattctctct 
ttaaatttct 
agcacatagc 
tggctttaca 
acatattgga 
tgtct ttatc 
aacaagaaag 
caacatttta 


1682 
1742 
1802 
1862 
1922 
1982 
2042 
2102 
2162 
2222 
2282 
2342 
2402 
2462 
2522 
2582 
2642 
2702 
2762 
2822 
2882 
2942 
3002 
3062 
3122 
3182 
3242 
3302 
3362 
3422 
3482 
3542 
3549 


<210> 14 

<211> 475 

<212> PRT 

<213> Homo sapien 


<400> 14 

Met Glu Ser Lys Ala Leu Leu Val Leu Thr Leu 

15 10 
Ser Leu Thr Ala Ser Arg Gly Gly Val Ala Ala 

20 25 
Asp Phe lie Asp lie Glu Ser Lys Phe Ala Leu 

35 40 
Thr Ala Glu Asp Thr Cys His Leu lie Pro Gly 
50 55 


Ala Val Trp Leu Gin 
15 

Ala Asp Gin Arg Arg 
30 

Arg Thr Pro Glu Asp 
45 

Val Ala Glu Ser Val 
60 
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Ala 

Thr 

Cys 

His 

Phe 

Asn 

His 

Ser 

Ser 

Lys 

Thr 

Phe 

Met 

Val 

He 

His 

65 




70 





75 





80 

Gly 

Trp 

Thr 

Val 

Thr 

Gly 

Met 

Tyr 

Glu 

Ser 

Trp 

Val 

Pro 

Lys 

Leu 

Val 





85 





90 





95 


Ala 

Ala 

Leu 

Tyr 

Lys 

Arg 

Glu 

Pro 

Asp 

Ser 

Asn 

Val 

He 

Val 

Val 

Asp 




100 





105 





110 



Trp 

Leu 

Ser 

Arg 

Ala 

Gin 

Glu 

His 

Tyr 

Pro 

Val 

Ser 

Ala 

Gly 

Tyr 

Thr 



115 





120 





125 




Lys 

Leu 

Val 

Gly 

Gin 

Asp 

Val 

Ala 

Arg 

Phe 

He 

Asn 

Trp 

Met 

Glu 

Glu 


130 





135 





140 





Glu 

Phe 

Asn 

Tyr 

Pro 

Leu 

Asp 

Asn 

Val 

His 

Leu 

Leu 

Gly 

Tyr 

Ser 

Leu 

145 





150 





155 





160 

Gly 

Ala 

His 

Ala 

Ala 

Gly 

He 

Ala 

Gly 

Ser 

Leu 

Thr 

Asn 

Lys 

Lys 

Val 





165 





170 





175 


Asn 

Arg 

He 

Thr 

Gly 

Leu 

Asp 

Pro 

Ala 

Gly 

Pro 

Asn 

Phe 

Glu 

Tyr 

Ala 




180 





185 





190 



Glu 

Ala 

Pro 

Ser 

Arg 

Leu 

Ser 

Pro 

Asp 

Asp 

Ala 

Asp 

Phe 

Val 

Asp 

Val 



195 





200 





205 




Leu 

His 

Thr 

Phe 

Thr 

Arg 

Gly 

Ser 

Pro 

Gly 

Arg 

Ser 

He 

Gly 

He 

Gin 


210 





215 





220 





Lys 

Pro 

Val 

Gly His 

Val 

Asp 

He 

Tyr 

Pro 

Asn 

Gly 

Gly 

Thr 

Phe 

Gin 

225 





230 





235 





240 

Pro 

Gly 

Cys 

Asn 

He 

Gly 

Glu 

Ala 

He 

Arg 

Val 

He 

Ala 

Glu 

Arg 

Gly 





245 





250 





255 


Leu 

Gly 

Asp 

Val 

Asp 

Gin 

Leu 

Val 

Lys 

Cys 

Ser 

His 

Glu 

Arg 

Ser 

He 




260 





265 





270 



His 

Leu 

Phe 

He 

Asp 

Ser 

Leu 

Leu 

Asn 

Glu 

Glu 

Asn 

Pro 

Ser 

Lys 

Ala 



275 





280 





285 




Tyr 

Arg 

Cys 

Ser 

Ser 

Lys 

Glu 

Ala 

Phe 

Glu 

Lys 

Gly 

Leu 

Cys 

Leu 

Ser 


290 





295 





300 





Cys 

Arg 

Lys 

Asn 

Arg 

Cys 

Asn 

Asn 

Leu 

Gly 

Tyr 

Glu 

He 

Asn 

Lys 

Val 

305 





310 





315 





320 

Arg 

Ala 

Lys 

Arg 

Ser 

Ser 

Lys 

Met 

Tyr 

Leu 

Lys 

Thr 

Arg 

Ser 

Gin 

Met 





325 





330 





335 


Pro 

Tyr 

Lys 

Val 

Phe 

His 

Tyr 

Gin 

Val 

Lys 

He 

His 

Phe 

Ser 

Gly 

Thr 




340 





345 





350 



Glu 

Ser 

Glu 

Thr 

His 

Thr 

Asn 

Gin 

Ala 

Phe 

Glu 

He 

Ser 

Leu 

Tyr 

Gly 



355 





360 





365 




Thr 

Val 

Ala 

Glu 

Ser 

Glu 

Asn 

He 

Pro 

Phe 

Thr 

Leu 

Pro 

Glu 

Val 

Ser 


370 





375 





380 





Thr 

Asn 

Lys 

Thr 

Tyr 

Ser 

Phe 

Leu 

He 

Tyr 

Thr 

Glu 

Val 

Asp 

He 

Gly 

385 





390 





395 





400 

Glu 

Leu 

Leu 

Met 

Leu 

Lys 

Leu 

Lys 

Trp 

Lys 

Ser 

Asp 

Ser 

Tyr 

Phe 

Ser 





405 





410 





415 


Trp 

Ser 

Asp 

Trp 

Trp 

Ser 

Ser 

Pro 

Gly 

Phe 

Ala 

He 

Gin 

Lys 

He 

Arg 




420 





425 





430 



Val 

Lys 

Ala 

Gly 

Glu 

Thr 

Gin 

Lys 

Lys 

Val 

He 

Phe 

Cys 

Ser 

Arg 

Glu 



435 





440 





445 




Lys 

Val 

Ser 

His 

Leu 

Gin 

Lys 

Gly 

Lys 

Ala 

Pro 

Ala 

Val 

Phe 

Val 

Lys 

450 





455 





460 





Cys 

His 

Asp 

Lys 

Ser 

Leu 

Asn 

Lys 

Lys 

Ser 

Gly 






465 





470 





475 







<210> IS 

<211> 1466 

<212> DNA 

<213> Homo sapien 
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<220> 
<221> CDS 

<222> {115) . . . (1305) 

<223> Nucleotide sequence encoding apolipoprotein A- IV 
(APOA4) 

<400> 15 

agttcccact gcagcgcagg tgagctctcc tgaggacctc tctgtcagct cccctgattg 60 
tagggaggca tccagtgtgg caagaaactc ctccagccca gcaagcagct cagg atg 117 

Met 
1 

ttc ctg aag gcc gtg gtc ctg acc ctg gcc ctg gtg get gtc gcc gga 165 
Phe Leu Lys Ala Val Val Leu Thr Leu Ala Leu Val Ala Val Ala Gly 
5 10 15 

gcc agg get gag gtc agt get gac cag gtg gcc aca gtg atg tgg gac 213 
Ala Arg Ala Glu Val Ser Ala Asp Gin Val Ala Thr Val Met Trp Asp 
20 25 30 

tac ttc age cag ctg age aac aat gcc aag gag gcc gtg gaa cat etc 261 
Tyr Phe Ser Gin Leu Ser Asn Asn Ala Lys Glu Ala Val Glu His Leu 
35 40 45 

cag aaa tet gaa etc acc cag caa etc aat gee etc ttc cag gac aaa 309 
Gin Lys Ser Glu Leu Thr Gin Gin Leu Asn Ala Leu Phe Gin Asp Lys 
50 55 60 65 

ctt gga gaa gtg aac act tac gea ggt gac ctg cag aag aag ctg gtg 357 
Leu Gly Glu Val Asn Thr Tyr Ala Gly Asp Leu Gin Lys Lys Leu Val 
70 75 80 

cec ttt gcc acc gag ctg cat gaa cge ctg gcc aag gac teg gag aaa 4 05 

Pro Phe Ala Thr Glu Leu His Glu Arg Leu Ala Lys Asp Ser Glu Lys 
85 90 95 

ctg aag gag gag att ggg aag gag ctg gag gag ctg agg gcc egg ctg 453 
Leu Lys Glu Glu lie Gly Lys Glu Leu Glu Glu Leu Arg Ala Arg Leu 
100 105 110 

ctg cce eat gee aat gag gtg age cag aag ate ggg gac aac ctg cga 501 
Leu Pro His Ala Asn Glu Val Ser Gin Lys lie Gly Asp Asn Leu Arg 
115 120 125 

gag ctt cag cag cge ctg gag cec tac geg gac cag ctg cge acc cag 54 9 

Glu Leu Gin Gin Arg Leu Glu Pro Tyr Ala Asp Gin Leu Arg Thr Gin 
130 135 140 145 

gtc aac aeg cag gee gag cag ctg egg cge cag ctg ace cec tac gea 597 
Val Asn Thr Gin Ala Glu Gin Leu Arg Arg Gin Leu Thr Pro Tyr Ala 
150 155 160 

cag cge atg gag aga gtg ctg egg gag aac gcc gac age ctg cag gee 64 5 

Gin Arg Met Glu Arg Val Leu Arg Glu Asn Ala Asp Ser Leu Gin Ala 
165 170 175 

teg ctg agg cec cac gee gac gag etc aag gee aag ate gac cag aac 693 
Ser Leu Arg Pro His Ala Asp Glu Leu Lys Ala Lys lie Asp Gin Asn 
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180 185 190 

gtg gag gag etc aag gga cgc ctt acg ccc tac get gac gaa ttc aaa 741 
Val Glu Glu Leu Lys Gly Arg Leu Thr Pro Tyr Ala Asp Glu Phe Lys 
195 200 205 

gtc aag att gac cag acc gtg gag gag ctg egc cge age ctg get ccc 789 
Val Lys lie Asp Gin Thr Val Glu Glu Leu Arg Arg Ser Leu Ala Pro 
210 215 220 225 

tat get cag gac acg cag gag aag etc aac cac cag ctt gag ggc ctg 837 
Tyr Ala Gin Asp Thr Gin Glu Lys Leu Asn His Gin Leu Glu Gly Leu 
230 235 240 

acc ttc cag atg aag aag aac gee gag gag etc aag gee agg ate teg 885 
Thr Phe Gin Met Lys Lys Asn Ala Glu Glu Leu Lys Ala Arg lie Ser 
245 250 255 

gee agt gee gag gag ctg egg cag agg ctg geg ccc ttg gcc gag gae 93 3 

Ala Ser Ala Glu Glu Leu Arg Gin Arg Leu Ala Pro Leu Ala Glu Asp 
260 265 270 

gtg cgt ggc aac ctg agg ggc aac acc gag ggg ctg cag aag tea ctg 981 
Val Arg Gly Asn Leu Arg Gly Asn Thr Glu Gly Leu Gin Lys Ser Leu 
275 280 285 

gca gag ctg ggt ggg cac ctg gac cag cag gtg gag gag ttc cga cgc 102 9 

Ala Glu Leu Gly Gly His Leu Asp Gin Gin Val Glu Glu Phe Arg Arg 
290 295 300 305 

egg gtg gag ccc tac ggg gaa aac ttc aac aaa gcc ctg gtg cag cag 1077 
Arg Val Glu Pro Tyr Gly Glu Asn Phe Asn Lys Ala Leu Val Gin Gin 
310 315 320 

atg gaa cag etc agg acg aaa ctg ggc ccc eat geg ggg gac gtg gaa 1125 
Met Glu Gin Leu Arg Thr Lys Leu Gly Pro His Ala Gly Asp Val Glu 
325 330 335 

ggc cac ttg age ttc ctg gag aag gac ctg agg gac aag gtc aac tec 1173 
Gly His Leu Ser Phe Leu Glu Lys Asp Leu Arg Asp Lys Val Asn Ser 
340 345 350 

ttc ttc age acc ttc aag gag aaa gag age cag gac aag act etc tec 1221 
Phe Phe Ser Thr Phe Lys Glu Lys Glu Ser Gin Asp Lys Thr Leu Ser 
355 360 365 

etc cet gag ctg gag caa cag cag gaa cag cat cag gag cag cag cag 1269 
Leu Pro Glu Leu Glu Gin Gin Gin Glu Gin His Gin Glu Gin Gin Gin 
370 375 380 385 

gag cag gtg cag atg ctg gee ect ttg gag age tga gctgeecctg 1315 
Glu Gin Val Gin Met Leu Ala Pro Leu Glu Ser * 
390 395 

gtgcactggc cccacccteg tggacacetg ccctgccctg eeacctgtct gtetgtccca 1375 
aagaagttct ggtatgaact tgaggacaca tgtccagtgg gaggtgagac caectctcaa 1435 
tatteaataa agetgctgag aatctagcct c 1466 
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<210> 16 

<211> 396 

<212> PRT 

<213> Homo sapien 


<400> 16 


Met 

Phe 

Leu 

Lys 

Ala 

Val 

Val 

Leu 

Thr 

Leu 

Ala 

Leu 

Val 

Ala 

Val 

Ala 

1 




5 





10 





15 


Gly 

Ala 

Arg 

Ala 

Glu 

Val 

Ser 

Ala 

Asp 

Gin 

Val 

Ala 

Thr 

Val 

Met 

Trp 




20 





25 





30 



Asp 

Tyr 

Phe 

Ser 

Gin 

Leu 

Ser 

Asn 

Asn 

Ala 

Lys 

Glu 

Ala 

Val 

Glu 

His 



35 





40 





45 




Leu 

Gin 

Lys 

Ser 

Glu 

Leu 

Thr 

Gin 

Gin 

Leu 

Asn 

Ala 

Leu 

Phe 

Gin 

Asp 


50 





55 





60 





Lys 

Leu 

Gly 

Glu 

Val 

Asn 

Thr 

Tyr 

Ala 

Gly 

Asp 

Leu 

Gin 

Lys 

Lys 

Leu 

65 





70 





75 





80 

Val 

Pro 

Phe 

Ala 

Thr 

Glu 

Leu 

His 

Glu 

Arg 

Leu 

Ala 

Lys 

Asp 

Ser 

Glu 





85 





90 





95 


Lys 

Leu 

Lys 

Glu 

Glu 

He 

Gly 

Lys 

Glu 

Leu 

Glu 

Glu 

Leu 

Arg 

Ala 

Arg 




100 





105 





110 



Leu 

Leu 

Pro 

His 

Ala 

Asn 

Glu 

Val 

Ser 

Gin 

Lys 

He 

Gly 

Asp 

Asn 

Leu 



115 





120 





125 




Arg 

Glu 

Leu 

Gin 

Gin 

Arg 

Leu 

Glu 

Pro 

Tyr 

Ala 

Asp 

Gin 

Leu 

Arg 

Thr 


130 





135 





140 





Gin 

Val 

Asn 

Thr 

Gin 

Ala 

Glu 

Gin 

Leu 

Arg 

Arg 

Gin 

Leu 

Thr 

Pro 

Tyr 

145 





150 





155 





160 

Ala 

Gin 

Arg 

Met 

Glu 

Arg 

Val 

Leu 

Arg 

Glu 

Asn 

Ala 

Asp 

Ser 

Leu 

Gin 





165 





170 





175 


Ala 

Ser 

Leu 

Arg 

Pro 

His 

Ala 

Asp 

Glu 

Leu 

Lys 

Ala 

Lys 

He 

Asp 

Gin 




180 





185 





190 



Asn 

Val 

Glu 

Glu 

Leu 

Lys 

Gly 

Arg 

Leu 

Thr 

Pro 

Tyr 

Ala 

Asp 

Glu 

Phe 



195 





200 





205 




Lys 

Val 

Lys 

He 

Asp 

Gin 

Thr 

Val 

Glu 

Glu 

Leu 

Arg 

Arg 

Ser 

Leu 

Ala 


210 





215 





220 





Pro 

Tyr 

Ala 

Gin 

Asp 

Thr 

Gin 

Glu 

Lys 

Leu 

Asn 

His 

Gin 

Leu 

Glu 

Gly 

225 





230 





235 





240 

Leu 

Thr 

Phe 

Gin 

Met 

Lys 

Lys 

Asn 

Ala 

Glu 

Glu 

Leu 

Lys 

Ala 

Arg 

He 





245 





250 





255 


Ser 

Ala 

Ser 

Ala 

Glu 

Glu 

Leu 

Arg 

Gin 

Arg 

Leu 

Ala 

Pro 

Leu 

Ala 

Glu 




260 





265 





270 



Asp 

Val 

Arg 

Gly 

Asn 

Leu 

Arg 

Gly 

Asn 

Thr 

Glu 

Gly 

Leu 

Gin 

Lys 

Ser 



275 





280 





285 




Leu 

Ala 

Glu 

Leu 

Gly 

Gly 

His 

Leu 

Asp 

Gin 

Gin 

Val 

Glu 

Glu 

Phe 

Arg 


290 





295 





300 





Arg 

Arg 

Val 

Glu 

Pro 

Tyr 

Gly 

Glu 

Asn 

Phe 

Asn 

Lys 

Ala 

Leu 

Val 

Gin 

305 





310 





315 





320 

Gin 

Met 

Glu 

Gin 

Leu 

Arg 

Thr 

Lys 

Leu 

Gly 

Pro 

His 

Ala 

Gly 

Asp 

Val 





325 





330 





335 


Glu 

Gly 

His 

Leu 

Ser 

Phe 

Leu 

Glu 

Lys 

Asp 

Leu 

Arg 

Asp 

Lys 

Val 

Asn 




340 





345 





350 



Ser 

Phe 

Phe 

Ser 

Thr 

Phe 

Lys 

Glu 

Lys 

Glu 

Ser 

Gin 

Asp 

Lys 

Thr 

Leu 



355 





360 





365 




Ser 

Leu 

Pro 

Glu 

Leu 

Glu 

Gin 

Gin 

Gin 

Glu 

Gin 

His 

Gin 

Glu 

Gin 

Gin 


370 





375 





380 





Gin 

Glu 

Gin 

Val 

Gin 

Met 

Leu 

Ala 

Pro 

Leu 

Glu 

Ser 





385 





390 





395 







<210> 17 
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<211> 1156 

<212> DNA 

<213> Homo sapieh 

<220> 
<221> CDS 

<222> (61) . . . (1014) 

<223> Nucleotide Sequence encoding apolipoprotein E 
(APOE) 

<400> 17 

cgcagcggag gtgaaggacg tccttcccca ggagccgact ggccaatcac aggcaggaag 60 
atg aag gtt ctg tgg get gcg ttg ctg gtc aca ttc ctg gca gga tgc 108 
Met Lys Val Leu Trp Ala Ala Leu Leu Val Thr Phe Leu Ala Gly Cys 
15 10 15 

cag gcc aag gtg gag caa gcg gtg gag aca gag ccg gag ccc gag ctg 156 
Gin Ala Lys Val Glu Gin Ala Val Glu Thr Glu Pro Glu Pro Glu Leu 
20 25 30 

cgc cag cag acc gag tgg cag age ggc cag cgc tgg gaa ctg gca ctg 204 
Arg Gin Gin Thr Glu Trp Gin Ser Gly Gin Arg Trp Glu Leu Ala Leu 
35 40 45 

ggt cgc ttt tgg gat tac ctg cgc tgg gtg cag aca ctg tct gag cag 252 
Gly Arg Phe Trp Asp Tyr Leu Arg Trp Val Gin Thr Leu Ser Glu Gin 
50 55 60 

gtg cag gag gag ctg etc age tee cag gtc acc cag gaa ctg agg gcg 300 
Val Gin Glu Glu Leu Leu Ser Ser Gin Val Thr Gin Glu Leu Arg Ala 
65 70 75 80 

ctg atg gac gag acc atg aag gag ttg aag gcc tac aaa teg gaa ctg 348 
Leu Met Asp Glu Thr Met Lys Glu Leu Lys Ala Tyr Lys Ser Glu Leu 
85 90 95 

gag gaa caa ctg acc ccg gtg gcg gag gag acg egg gca egg ctg tec 396 
Glu Glu Gin Leu Thr Pro Val Ala Glu Glu Thr Arg Ala Arg Leu Ser 
100 105 110 

aag gag ctg cag gcg gcg cag gee egg ctg ggc gcg gac atg gag gac 444 
Lys Glu Leu Gin Ala Ala Gin Ala Arg Leu Gly Ala Asp Met Glu Asp 
115 120 125 

gtg tgc ggc cgc ctg gtg cag tac cgc ggc gag gtg cag gcc atg etc 492 
Val Cys Gly Arg Leu Val Gin Tyr Arg Gly Glu Val Gin Ala Met Leu 
130 135 140 

ggc cag age acc gag gag ctg egg gtg cgc etc gcc tec cac ctg cgc 540 
Gly Gin Ser Thr Glu Glu Leu Arg Val Arg Leu Ala Ser His Leu Arg 
145 150 155 160 

aag ctg cgt aag egg etc etc cgc gat gee gat gac ctg cag aag cgc 588 
Lys Leu Arg Lys Arg Leu Leu Arg Asp Ala Asp Asp Leu Gin Lys Arg 
165 170 175 

ctg gca gtg tac cag gcc ggg gcc cgc gag ggc gee gag cgc ggc etc 636 
Leu Ala Val Tyr Gin Ala Gly Ala Arg Glu Gly Ala Glu Arg Gly Leu 
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180 185 190 

age gcc ate ege gag ege etg ggg eec etg gtg gaa cag ggc cge gtg 684 
Ser Ala lie Arg Glu Arg Leu Gly Pro Leu Val Glu Gin Gly Arg Val 
195 200 205 

egg gee gee aet gtg ggc tec etg gcc ggc cag ccg eta cag gag egg 732 
Arg Ala Ala Thr Val Gly Ser Leu Ala Gly Gin Pro Leu Gin Glu Arg 
210 215 220 

gee cag gcc tgg ggc gag egg etg cgc gcg egg atg gag gag atg ggc 780 
Ala Gin Ala Trp Gly Glu Arg Leu Arg Ala Arg Met Glu Glu Met Gly 
225 230 235 240 

age egg ace ege gae ege etg gac gag gtg aag gag cag gtg gcg gag 82 8 

Ser Arg Thr Arg Asp Arg Leu Asp Glu Val Lys Glu Gin Val Ala Glu 
245 250 255 

gtg cgc gcc aag etg gag gag cag gcc cag cag ata cgc etg cag gcc 876 
Val Arg Ala Lys Leu Glu Glu Gin Ala Gin Gin lie Arg Leu Gin Ala 
260 265 270 

gag gcc ttc cag gcc cgc etc aag age tgg ttc gag eec etg gtg gaa 924 
Glu Ala Phe Gin Ala Arg Leu Lys Ser Trp Phe Glu Pro Leu Val Glu 
275 280 285 

gac atg cag ege cag tgg gcc ggg etg gtg gag aag gtg cag get gcc 972 
Asp Met Gin Arg Gin Trp Ala Gly Leu Val Glu Lys Val Gin Ala Ala 
290 295 300 

gtg ggc ace age gee gee ect gtg ccc age gac aat cac tga 1014 
Val Gly Thr Ser Ala Ala Pro Val Pro Ser Asp Asn His * 
305 310 315 

aegcegaagc ctgcagccat gcgaccccac gccaccecgt gcctcctgcc tccgcgcagc 1074 

ctgcagcggg agaceetgtc cccgccccag ccgtceteet ggggtggacc ctagtttaat 1134 

aaagattcac caagtttcac ge 1156 

<210> 18 

<211> 317 

<212> PRT 

<213> Homo sapien 

<400> 18 

Met Lys Val Leu Trp Ala Ala Leu Leu Val Thr Phe Leu Ala Gly Cys 

15 10 15 

Gin Ala Lys Val Glu Gin Ala Val Glu Thr Glu Pro Glu Pro Glu Leu 

20 25 30 

Arg Gin Gin Thr Glu Trp Gin Ser Gly Gin Arg Trp Glu Leu Ala Leu 

35 40 45 

Gly Arg Phe Trp Asp Tyr Leu Arg Trp Val Gin Thr Leu Ser Glu Gin 

50 55 60 

Val Gin Glu Glu Leu Leu Ser Ser Gin Val Thr Gin Glu Leu Arg Ala 
65 70 75 80 

Leu Met Asp Glu Thr Met Lys Glu Leu Lys Ala Tyr Lys Ser Glu Leu 

85 90 95 

Glu Glu Gin Leu Thr Pro Val Ala Glu Glu Thr Arg Ala Arg Leu Ser 
100 105 110 
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Lys Glu Leu 

Gin 

Ala 

Ala 

Gin 

Ala 

Arg 

Leu 

Gly 

Ala 

Asp 

Met 

Glu 

Asp 

115 





120 





125 




Val Cys Gly Arg 

Leu 

Val 

Gin 

Tyr 

Arg 

Gly 

Glu 

Val 

Gin 

Ala 

Met 

Leu 

130 




135 





140 





Gly Gin Ser 

Thr 

Glu 

Glu 

Leu 

Arg 

Val 

Arg 

Leu 

Ala 

Ser 

His 

Leu 

Arg 

145 



150 





155 





160 

Lys Leu Arg 

Lys 

Arg 

Leu 

Leu 

Arg 

Asp 

Ala 

Asp 

Asp 

Leu 

Gin 

Lys 

Arg 



165 





170 





175 


Leu Ala Val 

Tyr 

Gin 

Ala 

Gly 

Ala 

Arg 

Glu 

Gly 

Ala 

Glu 

Arg 

Gly 

Leu 


180 





185 





190 



Ser Ala lie 

Arg 

Glu 

Arg 

Leu 

Gly 

Pro 

Leu 

Val 

Glu 

Gin 

Gly 

Arg 

Val 

195 





200 





205 




Arg Ala Ala 

Thr 

Val 

Gly 

Ser 

Leu 

Ala 

Gly 

Gin 

Pro 

Leu 

Gin 

Glu 

Arg 

210 




215 





22 0 





Ala Gin Ala 

Trp 

Gly 

Glu 

Arg 

Leu 

Arg 

Ala 

Arg 

Met 

Glu 

Glu 

Met 

Gly 

225 



230 





235 





240 

Ser Arg Thr 

Arg 

Asp 

Arg 

Leu 

Asp 

Glu 

Val 

Lys 

Glu 

Gin 

Val 

Ala 

Glu 



245 





250 





255 


Val Arg Ala 

Lys 

Leu 

Glu 

Glu 

Gin 

Ala 

Gin 

Gin 

He 

Arg 

Leu 

Gin 

Ala 


260 





265 





270 



Glu Ala Phe 

Gin 

Ala 

Arg 

Leu 

Lys 

Ser 

Trp 

Phe 

Glu 

Pro 

Leu 

Val 

Glu 

275 





280 





285 




Asp Met Gin 

Arg 

Gin 

Trp 

Ala 

Gly 

Leu 

Val 

Glu 

Lys 

Val 

Gin 

Ala 

Ala 

290 




295 





300 





Val Gly Thr 

Ser 

Ala 

Ala 

Pro 

Val 

Pro 

Ser 

Asp 

Asn 

His 




305 



310 





315 






<210> 19 














<211> 1603 














<212> DNA 














<213> Homo 1 

sapien 












<220> 














<221> CDS 














<222> (58) . 

. . (1557) 












<223> Nucleotide sequence encoding hepatic 

lipase (LIPC) 



<400> 19 















ggtctctttg gcttcagaaa ttaccaagaa agcctggacc ccgggtgaaa cggagaa atg 60 

Met 
1 

gac aca agt ccc ctg tgt ttc tec att ctg ttg gtt tta tgc ate ttt 108 
Asp Thr Ser Pro Leu Cys Phe Ser He Leu Leu Val Leu Cys He Phe 
5 10 15 

ate caa tea agt gee ctt gga caa age ctg aaa cca gag cca ttt gga 156 
He Gin Ser Ser Ala Leu Gly Gin Ser Leu Lys Pro Glu Pro Phe Gly 
20 25 30 

aga aga get caa get gtt gaa aca aac aaa acg ctg cat gag atg aag 204 
Arg Arg Ala Gin Ala Val Glu Thr Asn Lys Thr Leu His Glu Met Lys 
35 40 45 

acc aga ttc ctg etc ttt gga gaa ace aat cag ggc tgt cag att cga 252 
Thr Arg Phe Leu Leu Phe Gly Glu Thr Asn Gin Gly Cys Gin He Arg 
50 55 60 65 
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atc aat cat ccg gac acg tta cag gag tgc ggc ttc aac tec tec ctg 300 
lie Asn His Pro Asp Thr Leu Gin Glu Cys Gly Phe Asn Ser Ser Leu 
70 75 80 

cct ctg gtg atg ata ate cac ggg tgg teg gtg gac ggc gtg eta gaa 348 
Pro Leu Val Met lie He His Gly Trp Ser Val Asp Gly Val Leu Glu 
85 90 95 

aac tgg ate tgg cag atg gtg gee geg ctg aag tct cag ccg gee cag 396 
Asn Trp He Trp Gin Met Val Ala Ala Leu Lys Ser Gin Pro Ala Gin 
100 105 110 

cca gtg aac gtg ggg ctg gtg gac tgg ate acc ctg gee cac gac cac 444 
Pro Val Asn Val Gly Leu Val Asp Trp He Thr Leu Ala His Asp His 
115 120 125 

tac acc ate gee gtc egc aac ace ege ett gtg ggc aag gag gtc geg 492 
Tyr Thr He Ala Val Arg Asn Thr Arg Leu Val Gly Lys Glu Val Ala 
130 135 140 145 

get ett etc egg tgg ctg gag gaa tct gtt caa etc tct cga age cat 540 
Ala Leu Leu Arg Trp Leu Glu Glu Ser Val Gin Leu Ser Arg Ser His 
150 155 160 

gtt cac eta att ggg tac age ctg ggt gca cac gtg tea gga ttt gee 588 
Val His Leu He Gly Tyr Ser Leu Gly Ala His Val Ser Gly Phe Ala 
165 170 175 

ggc agt tec ate ggt gga acg cac aag att ggg aga ate aca ggg ctg 63 6 

Gly Ser Ser He Gly Gly Thr His Lys He Gly Arg He Thr Gly Leu 
180 185 190 

gat gee geg gga cct ttg ttt gag gga agt gee cee age aat egt ett 684 
Asp Ala Ala Gly Pro Leu Phe Glu Gly Ser Ala Pro Ser Asn Arg Leu 
195 200 205 

tct cca gat gat gcc aat ttt gtg gat gcc att cat acc ttt acg egg 732 
Ser Pro Asp Asp Ala Asn Phe Val Asp Ala He His Thr Phe Thr Arg 
210 215 220 225 

gag cac atg ggc ctg age gtg ggc ate aaa cag ccc ata gga cac tat 780 
Glu His Met Gly Leu Ser Val Gly He Lys Gin Pro He Gly His Tyr 
230 235 240 

gac ttc tat cee aac ggg ggc tec ttc cag cct ggc tgc cac ttc eta 828 
Asp Phe Tyr Pro Asn Gly Gly Ser Phe Gin Pro Gly Cys His Phe Leu 
245 250 255 

gag etc tac aga cat att gcc cag cac ggc ttc aat gcc ate acc cag 876 
Glu Leu Tyr Arg His He Ala Gin His Gly Phe Asn Ala He Thr Gin 
260 265 270 

acc ata aaa tgc tec cac gag cga teg gtg cac ett ttc ate gac tec 924 
Thr He Lys Cys Ser His Glu Arg Ser Val His Leu Phe He Asp Ser 
275 280 285 

ttg ctg cac gee ggc acg cag age atg gee tac ccg tgt ggt gac atg 972 
Leu Leu His Ala Gly Thr Gin Ser Met Ala Tyr Pro Cys Gly Asp Met 
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290 295 300 305 

aac age ttc age cag ggc ctg tgc ctg age tge aag aag ggc cgc tgc 1020 
Asn Ser Phe Ser Gin Gly Leu Cys Leu Ser Cys Lys Lys Gly Arg Cys 
310 315 320 

aac acg ctg ggc tac cac gtc cgc cag gag ccg egg age aag age aag 1068 
Asn Thr Leu Gly Tyr His Val Arg Gin Glu Pro Arg Ser Lys Ser Lys 
325 330 335 

agg etc ttc etc gta acg cga gee cag tee ecc ttc aaa gtt tat cat 1116 
Arg Leu Phe Leu Val Thr Arg Ala Gin Ser Pro Phe Lys Val Tyr His 
340 345 350 

tac cag tta aag ate cag ttc ate aac caa act gag acg cca ata caa 1164 
Tyr Gin Leu Lys lie Gin Phe lie Asn Gin Thr Glu Thr Pro lie Gin 
355 360 365 

aca act ttt ace atg tea eta etc gga aca aaa gag aaa atg cag aaa 1212 
Thr Thr Phe Thr Met Ser Leu Leu Gly Thr Lys Glu Lys Met Gin Lys 
370 375 380 385 

att ecc ate act ctg ggc aaa gga att get agt aat aaa acg tat tee 1260 
lie Pro lie Thr Leu Gly Lys Gly lie Ala Ser Asn Lys Thr Tyr Ser 
390 395 400 

ttt ctt ate acg ctg gat gtg gat ate ggc gag ctg ate atg ate aag 1308 
Phe Leu lie Thr Leu Asp Val Asp lie Gly Glu Leu lie Met lie Lys 
405 410 415 

ttc aag tgg gaa aac agt gca gtg tgg gee aat gtc tgg gac acg gtc 1356 
Phe Lys Trp Glu Asn Ser Ala Val Trp Ala Asn Val Trp Asp Thr Val 
420 425 430 

cag acc ate ate cca tgg age aca ggg ccg cgc cac tea ggc etc gtt 1404 
Gin Thr lie lie Pro Trp Ser Thr Gly Pro Arg His Ser Gly Leu Val 
435 440 445 

ctg aag acg ate aga gtc aaa gea gga gaa ace cag caa aga atg aca 1452 
Leu Lys Thr lie Arg Val Lys Ala Gly Glu Thr Gin Gin Arg Met Thr 
450 455 460 465 

ttt tgt tea gaa aac aca gat gac eta eta ctt cgc cca acc cag gaa 1500 
Phe Cys Ser Glu Asn Thr Asp Asp Leu Leu Leu Arg Pro Thr Gin Glu 
470 475 480 

aaa ate ttc gtg aaa tgt gaa ata aag tet aaa aca tea aag cga aag 154 8 

Lys lie Phe Val Lys Cys Glu lie Lys Ser Lys Thr Ser Lys Arg Lys 
485 490 495 

ate aga tga gatttaatga agaeceagtg taaagaataa atgaatctta 1597 
lie Arg * 


etcett 


<210> 20 
<211> 499 


1603 
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<212> PRT 

<213> Homo sapien 


<400> 20 


Met 

Asp 

Thr 

Ser 

Pro 

Leu 

Cys 

Phe 

1 




5 




Phe 

He 

Gin 

Ser 

Ser 

Ala 

Leu 

Gly 




20 





Gly 

Arg 

Arg 

Ala 

Gin 

Ala 

Val 

Glu 



35 





40 

Lys 

Thr 

Arg 

Phe 

Leu 

Leu 

Phe 

Gly 


50 





55 


Arg 

He 

Asn 

His 

Pro 

Asp 

Thr 

Leu 

65 





70 



Leu 

Pro 

Leu 

Val 

Met 

He 

He 

His 





85 




Glu 

Asn 

Trp 

He 

Trp 

Gin 

Met 

Val 




100 





Gin 

Pro 

Val 

Asn 

Val 

Gly 

Leu 

Val 



115 





120 

His 

Tyr 

Thr 

He 

Ala 

Val 

Arg 

Asn 


130 





135 


Ala 

Ala 

Leu 

Leu 

Arg 

Trp 

Leu 

Glu 

145 





150 



His 

Val 

His 

Leu 

He 

Gly 

Tyr 

Ser 





165 




Ala 

Gly 

Ser 

Ser 

He 

Gly 

Gly 

Thr 




180 





Leu 

Asp 

Ala 

Ala 

Gly 

Pro 

Leu 

Phe 



195 





200 

Leu 

Ser 

Pro 

Asp 

Asp 

Ala 

Asn 

Phe 


210 





215 


Arg 

Glu 

His 

Met 

Gly 

Leu 

Ser 

Val 

225 





230 



Tyr 

Asp 

Phe 

Tyr 

Pro 

Asn 

Gly 

Gly 





245 




Leu 

Glu 

Leu 

Tyr 

Arg 

His 

He 

Ala 




260 





Gin 

Thr 

He 

Lys 

Cys 

Ser 

His 

Glu 



275 





280 

Ser 

Leu 

Leu 

His 

Ala 

Gly 

Thr 

Gin 


290 





295 


Met 

Asn 

Ser 

Phe 

Ser 

Gin 

Gly 

Leu 

305 





310 



Cys 

Asn 

Thr 

Leu 

Gly 

Tyr 

His 

Val 





325 




Lys 

Arg 

Leu 

Phe 

Leu 

Val 

Thr 

Arg 




340 





His 

Tyr 

Gin 

Leu 

Lys 

He 

Gin 

Phe 



355 





360 

Gin 

Thr 

Thr 

Phe 

Thr 

Met 

Ser 

Leu 


370 





375 


Lys 

He 

Pro 

He 

Thr 

Leu 

Gly 

Lys 

385 





390 



Ser 

Phe 

Leu 

He 

Thr 

Leu 

Asp 

Val 





405 




Lys 

Phe 

Lys 

Trp 

Glu 

Asn 

Ser 

Ala 




420 






Ser 

He 

Leu 

Leu 

Val 

Leu 

Cys 

He 


10 





15 


Gin 

Ser 

Leu 

Lys 

Pro 

Glu 

Pro 

Phe 

25 





30 



Thr 

Asn 

Lys 

Thr 

Leu 

His 

Glu 

Met 





45 




Glu 

Thr 

Asn 

Gin 

Gly 

Cys 

Gin 

He 




60 





Gin 

Glu 

Cys 

Gly 

Phe 

Asn 

Ser 

Ser 



75 





80 

Gly 

Trp 

Ser 

Val 

Asp 

Gly 

Val 

Leu 


90 





95 


Ala 

Ala 

Leu 

Lys 

Ser 

Gin 

Pro 

Ala 

105 





110 



Asp 

Trp 

He 

Thr 

Leu 

Ala 

His 

Asp 





125 




Thr 

Arg 

Leu 

Val 

Gly 

Lys 

Glu 

Val 




140 





Glu 

Ser 

Val 

Gin 

Leu 

Ser 

Arg 

Ser 



155 





160 

Leu 

Gly 

Ala 

His 

Val 

Ser 

Gly 

Phe 


170 





175 


His 

Lys 

He 

Gly 

Arg 

He 

Thr 

Gly 

185 





190 



Glu 

Gly 

Ser 

Ala 

Pro 

Ser 

Asn 

Arg 





205 




Val 

Asp 

Ala 

He 

His 

Thr 

Phe 

Thr 




220 





Gly 

He 

Lys 

Gin 

Pro 

He 

Gly 

His 



235 





240 

Ser 

Phe 

Gin 

Pro 

Gly 

Cys 

His 

Phe 


250 





255 


Gin 

His 

Gly 

Phe 

Asn 

Ala 

He 

Thr 

265 





270 



Arg 

Ser 

Val 

His 

Leu 

Phe 

He 

Asp 





285 




Ser 

Met 

Ala 

Tyr 

Pro 

Cys 

Gly 

Asp 




300 





Cys 

Leu 

Ser 

Cys 

Lys 

Lys 

Gly 

Arg 



315 





320 

Arg 

Gin 

Glu 

Pro 

Arg 

Ser 

Lys 

Ser 


330 





335 


Ala 

Gin 

Ser 

Pro 

Phe 

Lys 

Val 

Tyr 

345 





350 



He 

Asn 

Gin 

Thr 

Glu 

Thr 

Pro 

He 





365 




Leu 

Gly 

Thr 

Lys 

Glu 

Lys 

Met 

Gin 




380 





Gly 

He 

Ala 

Ser 

Asn 

Lys 

Thr 

Tyr 



395 





400 

Asp 

He 

Gly 

Glu 

Leu 

He 

Met 

He 


410 





415 


Val 

Trp 

Ala 

Asn 

Val 

Trp 

Asp 

Thr 


425 430 
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Val Gin Thr lie lie 
435 

Val Leu Lys Thr lie 
450 

Thr Phe Cys Ser Glu 
465 

Glu Lys lie Phe Val 
485 

Lys lie Arg 


Pro Trp Ser Thr Gly 
440 

Arg Val Lys Ala Gly 
455 

Asn Thr Asp Asp Leu 
470 

Lys Cys Glu lie Lys 
490 


Pro Arg His Ser Gly Leu 
445 

Glu Thr Gin Gin Arg Met 
460 

Leu Leu Arg Pro Thr Gin 
475 480 
Ser Lys Thr Ser Lys Arg 
495 


<210> 21 

<211> 1346 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (10) . , . (1077) 

<22 3> Nucleotide sequence encoding paraoxonase 1 (PONl) 
<400> 21 

cccccgacc atg gcg aag ctg att gcg etc acc etc ttg ggg atg gga ctg 51 
Met Ala Lys Leu lie Ala Leu Thr Leu Leu Gly Met Gly Leu 
15 10 

gca etc ttc agg aac cac cag tct tct tac caa aca cga ctt aat get 99 
Ala Leu Phe Arg Asn His Gin Ser Ser Tyr Gin Thr Arg Leu Asn Ala 
15 20 25 30 

etc cga gag gta caa ccc gta gaa ctt cet aac tgt aat tta gtt aaa 147 
Leu Arg Glu Val Gin Pro Val Glu Leu Pro Asn Cys Asn Leu Val Lys 
35 40 45 

gga ate gaa act ggc tct gaa gac atg gag ata ctg ect aat gga ctg 195 
Gly lie Glu Thr Gly Ser Glu Asp Met Glu lie Leu Pro Asn Gly Leu 
50 55 60 

get ttc att age tct gga tta aag tat cet gga ata aag age ttc aac 243 
Ala Phe lie Ser Ser Gly Leu Lys Tyr Pro Gly lie Lys Ser Phe Asn 
65 70 75 

ccc aac agt ect gga aaa ata ctt ctg atg gac ctg aat gaa gaa gat 291 
Pro Asn Ser Pro Gly Lys lie Leu Leu Met Asp Leu Asn Glu Glu Asp 
80 85 90 

cca aca gtg ttg gaa ttg ggg ate act gga agt aaa ttt gat gta tct 339 
Pro Thr Val Leu Glu Leu Gly lie Thr Gly Ser Lys Phe Asp Val Ser 
95 100 105 110 

tea ttt aac cet cat ggg att age aca ttc aca gat gaa gat aat gee 387 
Ser Phe Asn Pro His Gly lie Ser Thr Phe Thr Asp Glu Asp Asn Ala 
115 120 125 


atg tac etc ctg gtg gtg aac cat cca gat gee aag tec aca gtg gag 
Met Tyr Leu Leu Val Val Asn His Pro Asp Ala Lys Ser Thr Val Glu 
130 135 140 


435 
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ttg ttt aaa ttt caa gaa gaa gaa aaa teg ctt ttg cat eta aaa ace 483 
Leu Phe Lys Phe Gin Glu Glu Glu Lys Ser Leu Leu His Leu Lys Thr 
145 150 155 

ate aga eat aaa ctt etg cct aat ttg aat gat att gtt get gtg gga 531 
lie Arg His Lys Leu Leu Pro Asn Leu Asn Asp lie Val Ala Val Gly 
160 165 170 

cct gag cac ttt tat ggc aca aat gat cac tat ttt ctt gac ccc tac 579 
Pro Glu His Phe Tyr Gly Thr Asn Asp His Tyr Phe Leu Asp Pro Tyr 
175 180 185 190 

tta caa tec tgg gag atg tat ttg ggt tta geg tgg teg tat gtt gtc 627 
Leu Gin Ser Trp Glu Met Tyr Leu Gly Leu Ala Trp Ser Tyr Val Val 
195 200 205 

tac tat agt cca agt gaa gtt ega gtg gtg gea gaa gga ttt gat ttt 675 
Tyr Tyr Ser Pro Ser Glu Val Arg Val Val Ala Glu Gly Phe Asp Phe 
210 215 220 

get aat gga ate aac att tea ccc gat ggc aag tat gtc tat ata get 723 
Ala Asn Gly lie Asn lie Ser Pro Asp Gly Lys Tyr Val Tyr lie Ala 
225 230 235 

gag ttg etg get eat aag att eat gtg tat gaa aag cat get aat tgg 771 
Glu Leu Leu Ala His Lys lie His Val Tyr Glu Lys His Ala Asn Trp 
240 245 ' 250 

act tta act cca ttg aag tec ctt gac ttt aat ace etc gtg gat aac 819 
Thr Leu Thr Pro Leu Lys Ser Leu Asp Phe Asn Thr Leu Val Asp Asn 
255 260 265 270 

ata tct gtg gat cct gag aca gga gac ctt tgg gtt gga tgc cat ccc 867 
lie Ser Val Asp Pro Glu Thr Gly Asp Leu Trp Val Gly Cys His Pro 
275 280 285 

aat ggc atg aaa ate ttc ttc tat gac tea gag aat cct cct gea tea 915 
Asn Gly Met Lys lie Phe Phe Tyr Asp Ser Glu Asn Pro Pro Ala Ser 
290 295 300 

gag gtg ctt ega ate cag aac att eta aca gaa gaa cct aaa gtg aca 963 
Glu Val Leu Arg lie Gin Asn lie Leu Thr Glu Glu Pro Lys Val Thr 
305 310 315 

cag gtt tat gea gaa aat ggc aca gtg ttg caa ggc agt aca gtt gee 1011 
Gin Val Tyr Ala Glu Asn Gly Thr Val Leu Gin Gly Ser Thr Val Ala 
320 325 330 

tct gtg tac aaa ggg aaa etg etg att ggc aca gtg ttt cac aaa get 1059 
Ser Val Tyr Lys Gly Lys Leu Leu He Gly Thr Val Phe His Lys Ala 
335 340 . 345 350 

ctt tac tgt gag etc taa eagaeegatt tgeacecatg ecatagaaac 1107 
Leu Tyr Cys Glu Leu * 
355 


tgaggecatt attteaaccg ettgccatat tecgaggacc cagtgttett agetgaacaa 1167 
tgaatgctga ecetaaatgt ggaeatcatg aageatcaaa geaetgttta aetgggagtg 1227 
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atatgatgtg tagggctttt ttttgagaat 
aaacctcatt taccataaaa atccttctca 

<210> 22 

<211> 355 

<212> PRT 

<213> Homo sapien 


<400> 22 


Met 

Ala 

Lys 

Leu 

He 

Ala 

Leu 

Thr 

1 




5 




Phe 

Arg 

Asn 

His 

Gin 

Ser 

Ser 

Tyr 




20 





Glu 

Val 

Gin 

Pro 

Val 

Glu 

Leu 

Pro 



35 





40 

Glu 

Thr 

Gly 

Ser 

Glu 

Asp 

Met 

Glu 


50 





55 


He 

Ser 

Ser 

Gly 

Leu 

Lys 

Tyr 

Pro 

65 





70 



Ser 

Pro 

Gly 

Lys 

He 

Leu 

Leu 

Met 





85 




Val 

Leu 

Glu 

Leu 

Gly 

He 

Thr 

Gly 




100 





Asn 

Pro 

His 

Gly 

He 

Ser 

Thr 

Phe 



115 





120 

Leu 

Leu 

Val 

Val 

Asn 

His 

Pro 

Asp 


130 





135 


Lys 

Phe 

Gin 

Glu 

Glu 

Glu 

Lys 

Ser 

145 





150 



His 

Lys 

Leu 

Leu 

Pro 

Asn 

Leu 

Asn 





165 




His 

Phe 

Tyr 

Gly 

Thr 

Asn 

Asp 

His 




180 





Ser 

Trp 

Glu 

Met 

Tyr 

Leu 

Gly 

Leu 



195 





200 

Ser 

Pro 

Ser 

Glu 

Val 

Arg 

Val 

Val 


210 





215 


Gly 

He 

Asn 

He 

Ser 

Pro 

Asp 

Gly 

225 





230 



Leu 

Ala 

His 

Lys 

He 

His 

Val 

Tyr 





245 




Thr 

Pro 

Leu 

Lys 

Ser 

Leu 

Asp 

Phe 




260 





Val 

Asp 

Pro 

Glu 

Thr 

Gly 

Asp 

Leu 



275 





280 

Met 

Lys 

He 

Phe 

Phe 

Tyr 

Asp 

Ser 


290 





295 


Leu 

Arg 

He 

Gin 

Asn 

He 

Leu 

Thr 

305 





310 



Tyr 

Ala 

Glu 

Asn 

Gly 

Thr 

Val 

Leu 





325 




Tyr 

Lys 

Gly 

Lys 

Leu 

Leu 

He 

Gly 


340 


Cys Glu Leu 
355 


acactatcaa atcagtcttg gaatacttga 1287 
ctaaaatgga taaatcagtt aaaaaaaaa 1346 


Leu 

Leu 

Gly 

Met 

Gly 

Leu 

Ala 

Leu 


10 





15 


Gin 

Thr 

Arg 

Leu 

Asn 

Ala 

Leu 

Arg 

25 





30 



Asn 

Cys 

Asn 

Leu 

Val 

Lys 

Gly 

He 





45 




He 

Leu 

Pro 

Asn 

Gly 

Leu 

Ala 

Phe 




60 





Gly 

He 

Lys 

Ser 

Phe 

Asn 

Pro 

Asn 



75 





80 

Asp 

Leu 

Asn 

Glu 

Glu 

Asp 

Pro 

Thr 


90 





95 


Ser 

Lys 

Phe 

Asp 

Val 

Ser 

Ser 

Phe 

105 





110 



Thr 

Asp 

Glu 

Asp 

Asn 

Ala 

Met 

Tyr 





125 




Ala 

Lys 

Ser 

Thr 

Val 

Glu 

Leu 

Phe 




140 





Leu 

Leu 

His 

Leu 

Lys 

Thr 

He 

Arg 



155 





160 

Asp 

He 

Val 

Ala 

Val 

Gly 

Pro 

Glu 


170 





175 


Tyr 

Phe 

Leu 

Asp 

Pro 

Tyr 

Leu 

Gin 

185 





190 



Ala 

Trp 

Ser 

Tyr 

Val 

Val 

Tyr 

Tyr 





205 




Ala 

Glu 

Gly 

Phe 

Asp 

Phe 

Ala 

Asn 




220 





Lys 

Tyr 

Val 

Tyr 

He 

Ala 

Glu 

Leu 



235 





240 

Glu 

Lys 

His 

Ala 

Asn 

Trp 

Thr 

Leu 


250 





255 


Asn 

Thr 

Leu 

Val 

Asp 

Asn 

He 

Ser 

265 





270 



Trp 

Val 

Gly 

Cys 

His 

Pro 

Asn 

Gly 





285 




Glu 

Asn 

Pro 

Pro 

Ala 

Ser 

Glu 

Val 




300 





Glu 

Glu 

Pro 

Lys 

Val 

Thr 

Gin 

Val 



315 





320 

Gin 

Gly 

Ser 

Thr 

Val 

Ala 

Ser 

Val 


330 





335 


Thr 

Val 

Phe 

His 

Lys 

Ala 

Leu 

Tyr 

345 





350 




<210> 23 
<211> 1570 
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<212> DNA 

<213> Homo sapien 

<220> 

<221> CDS 

<222> (1) . . . (1097) 

<223> Nucleotide sequence encoding paraoxonase 2 (PON2) 
<400> 23 

egg age gag gca gcg cgc ccg get ccc gcg cca tgg ggc ggc tgg tgg 4 8 

Arg Ser Glu Ala Ala Arg Pro Ala Pro Ala Pro Trp Gly Gly Trp Txp 
15 10 15 

ctg tgg get tgc tgg gga teg cgc tgg cgc tec tgg gcg aga ggc ttc 96 
Leu Trp Ala Cys Trp Gly Ser Arg Trp Arg Ser Trp Ala Arg Gly Phe 
20 25 30 

tgg cae tea gaa ate gac tta aag ect cca gag aag tag aat ctg tag 144 
Trp His Ser Glu lie Asp Leu Lys Pro Pro Glu Lys * Asn Leu * 
35 40 45 

ace ttc eac act gcc acc tga tta aag gaa ttg aag ctg get ctg aag 192 
Thr Phe His Thr Ala Thr * Leu Lys Glu Leu Lys Leu Ala Leu Lys 
50 55 60 

ata ttg aca tac ttc cca atg gtc tgg ett ttt tta gtg tgg gtc taa 240 
lie Leu Thr Tyr Phe Pro Met Val Trp Leu Phe Leu Val Trp Val * 
65 70 75 

aat tec eag gac tee aca get ttg eac eag ata age ctg gag gaa tac 288 
Asn Ser Gin Asp Ser Thr Ala Leu His Gin lie Ser Leu Glu Glu Tyr 
80 85 90 

taa tga tgg ate taa aag aag aaa aac caa ggg eac ggg aat taa gaa 33 6 

* * Trp lie * Lys Lys Lys Asn Gin Gly His Gly Asn * Glu 

95 100 

tea gtc gtg ggt ttg att tgg cet cat tea ate eac atg gca tea gca 384 
Ser Val Val Gly Leu lie Trp Pro His Ser lie His Met Ala Ser Ala 
105 110 115 120 

ett tea tag aca acg atg aca cag ttt ate tct ttg ttg taa acc ace 432 
Leu Ser * Thr Thr Met Thr Gin Phe lie Ser Leu Leu * Thr Thr 

125 130 

cag aat tea aga ata cag tgg aaa ttt tta aat ttg aag aag cag aaa 480 
Gin Asn Ser Arg lie Gin Trp Lys Phe Leu Asn Leu Lys Lys Gin Lys 
135 140 145 150 

att etc tgt tgc ate tga aaa cag tea aac atg age ttc ttc caa gtg 528 
lie Leu Cys Cys lie * Lys Gin Ser Asn Met Ser Phe Phe Gin Val 
155 160 165 

tga atg aca tea eag ctg ttg gac egg eac att tet atg cca caa atg 576 

* Met Thr Ser Gin Leu Leu Asp Arg His lie Ser Met Pro Gin Met 

170 175 180 

acc act act tct ctg ate ett tet taa agt att tag aaa cat act tga 624 
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Thr Thr Thr Ser Leu lie Leu Ser * Ser lie 
185 190 


Lys His Thr 


act tac act ggg caa atg ttg ttt act aca gtc caa atg aag tta aag 
Thr Tyr Thr Gly Gin Met Leu Phe Thr Thr Val Gin Met Lys Leu Lys 
195 200 205 


672 


tgg tag cag aag gat ttg att cag caa atg gga tea ata ttt cac ctg 
Trp * Gin Lys Asp Leu lie Gin Gin Met Gly Ser lie Phe His Leu 
210 215 220 


720 


atg ata agt ata tct atg ttg ctg aca tat tgg etc atg aaa ttc atg 
Met lie Ser lie Ser Met Leu Leu Thr Tyr Trp Leu Met Lys Phe Met 
225 230 235 240 


768 


ttt tgg aaa aac aca eta ata tga att taa etc agt tga agg tac ttg 
Phe Trp Lys Asn Thr Leu lie * lie * Leu Ser * Arg Tyr Leu 
245 250 


816 


age tgg ata cac tgg tgg ata att tat eta ttg ate ctt cet egg ggg 
Ser Trp lie His Trp Trp lie lie Tyr Leu Leu lie Leu Pro Arg Gly 
255 260 265 


864 


aca tct ggg tag get gtc ate eta atg gcc aga age tct teg tgt atg 
Thr Ser Gly * Ala Val lie Leu Met Ala Arg Ser Ser Ser Cys Met 
270 275 280 


912 


acc cga aca ate etc cet egt cag agg ttc tee gea tee aga aca ttc 
Thr Arg Thr lie Leu Pro Arg Gin Arg Phe Ser Ala Ser Arg Thr Phe 
285 290 295 300 


960 


tat ctg aga age eta cag tga eta cag ttt atg cea aca atg ggt ctg 
Tyr Leu Arg Ser Leu Gin * Leu Gin Phe Met Pro Thr Met Gly Leu 
305 310 315 


1008 


ttc tec aag gaa gtt ctg tag cet cag tgt atg atg gga age tgc tea 
Phe Ser Lys Glu Val Leu * Pro Gin Cys Met Met Gly Ser Cys Ser 
320 325 330 


1056 


tag gea ctt tat acc aca gag cet tgt att gtg aac tct aa attgtacttt 
* Ala Leu Tyr Thr Thr Glu Pro Cys lie Val Asn Ser 

335 340 


1107 


tggcatgaaa 
tttaaeeage 
ggaacggcec 
taaaatgcca 
caeaaagtaa 
gtgaattata 
tgtttagaac 
aaaagttgga 


gtgegataac 
aacat tgacc 
ttttagttct 
agcaagggac 
gcctcacett 
tttccttaaa 
tactgataac 
gttatattaa 


ttaaeaatta 
eagaaatgta 
tagageactt 
agaaaagaaa 
tgcctteeaa 
atgtgagtga 
gtattttgat 
agactaacta 


attttctatg 
tggcatgtgt 
ttaacaaaaa 
gctgctttcg 
etgccagaac 
cetcacttct 
gttttgtact 
aaatcccagt 


aattgctaat 
agttaatttt 
aggaaaatga 
aataaagtga 
atggatteca 
ggcactgtga 
tacatctttg 
ttt 


tetgagggaa 
attccagtaa 
acaggttett 
atacatt ttg 
ctgaaataga 
ctactatggc 
tttaccatta 


1167 
1227 
1287 
1347 
1407 
1467 
1527 
1570 


<210> 24 

<211> 342 

<212> PRT 

<213> Homo sapien 


<400> 24 
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Arg 

Ser 

Glu 

Ala 

Ala 

Arg 

Pro 

Ala 

Pro 

Ala 

Pro 

Trp 

Gly 

Gly 

Trp 

Trp 

1 




5 





10 





15 


Leu 

Trp 

Ala 

Cys 

Trp 

Gly 

Ser 

Arg 

Trp 

Arg 

Ser 

Trp 

Ala 

Arg 

Gly 

Phe 




20 





25 





30 



Trp 

His 

Ser 

Glu 

He 

Asp 

Leu 

Lys 

Pro 

Pro 

Glu 

Lys 

Asn 

Leu 

Thr 

Phe 



35 





40 





45 




His 

Thr 

Ala 

Thr 

Leu 

Lys 

Glu 

Leu 

Lys 

Leu 

Ala 

Leu 

Lys 

He 

Leu 

Thr 


50 





55 





60 





Tyr 

Phe 

Pro 

Met 

Val 

Trp 

Leu 

Phe 

Leu 

Val 

Trp 

Val 

Asn 

Ser 

Gin 

Asp 

65 





70 





7 5 





80 

Ser 

Thr 

Ala 

Leu 

His 

Gin 

He 

Ser 

Leu 

Glu 

Glu 

Tyr 

Trp 

He 

Lys 

Lys 





85 





90 





95 


Lys 

Asn 

Gin 

Gly 

His 

Gly 

Asn 

Glu 

Ser 

Val 

Val 

Gly 

Leu 

He 

Trp 

Pro 




100 





105 





110 



His 

Ser 

He 

His 

Met 

Ala 

Ser 

Ala 

Leu 

Ser 

Thr 

Thr 

Met 

Thr 

Gin 

Phe 



115 





120 





125 




lie 

Ser 

Leu 

Leu 

Thr 

Thr 

Gin 

Asn 

Ser 

Arg 

He 

Gin 

Trp 

Lys 

Phe 

Leu 


130 





135 





140 





Asn 

Leu 

Lys 

Lys 

Gin 

Lys 

He 

Leu 

Cys 

Cys 

He 

Lys 

Gin 

Ser 

Asn 

Met 

145 





150 





155 





160 

Ser 

Phe 

Phe 

Gin 

Val 

Met 

Thr 

Ser 

Gin 

Leu 

Leu 

Asp 

Arg 

His 

He 

Ser 





165 





170 





175 


Met 

Pro 

Gin 

Met 

Thr 

Thr 

Thr 

Ser 

Leu 

He 

Leu 

Ser 

Ser 

He 

Lys 

His 




180 





185 





190 



Thr 

Thr 

Tyr 

Thr 

Gly 

Gin 

Met 

Leu 

Phe 

Thr 

Thr 

Val 

Gin 

Met 

Lys 

Leu 



195 





200 





205 




Lys 

Trp 

Gin 

Lys 

Asp 

Leu 

He 

Gin 

Gin 

Met 

Gly 

Ser 

He 

Phe 

His 

Leu 


210 





215 





220 





Met 

He 

Ser 

He 

Ser 

Met 

Leu 

Leu 

Thr 

Tyr 

Trp 

Leu 

Met 

Lys 

Phe 

Met 

225 





230 





235 





240 

Phe 

Trp 

Lys 

Asn 

Thr 

Leu 

He 

He 

Leu 

Ser 

Arg 

Tyr 

Leu 

Ser 

Trp 

He 





245 





250 





255 


His 

Trp 

Trp 

He 

He 

Tyr 

Leu 

Leu 

He 

Leu 

Pro 

Arg 

Gly 

Thr 

Ser 

Gly 




260 





265 





270 



Ala 

Val 

He 

Leu 

Met 

Ala 

Arg 

Ser 

Ser 

Ser 

Cys 

Met 

Thr 

Arg 

Thr 

He 



275 





280 





285 




Leu 

Pro 

Arg 

Gin 

Arg 

Phe 

Ser 

Ala 

Ser 

Arg 

Thr 

Phe 

Tyr 

Leu 

Arg 

Ser 


290 





295 





300 





Leu 

Gin 

Leu 

Gin 

Phe 

Met 

Pro 

Thr 

Met 

Gly 

Leu 

Phe 

Ser 

Lys 

Glu 

Val 

305 





310 





315 





320 

Leu 

Pro 

Gin 

Cys 

Met 

Met 

Gly 

Ser 

Cys 

Ser 

Ala 

Leu 

Tyr 

Thr 

Thr 

Glu 





325 





330 





335 


Pro 

Cys 

He 

Val 

Asn 

Ser 














340 














<210> 25 

<211> 533 

<212> DNA 

<213> Homo sapien 

<220> 

<221> CDS 

<222> (47) . . . (346) 

<223> Nucleotide sequence encoding apolipoprotein 
C-III (AP0C3) 


<400> 25 

tgctcagttc atccctagag gcagctgctc caggaacaga ggtgcc atg cag ccc 


55 
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Met Gin Pro 
1 

egg gta etc ctt gtt gtt gcc etc ctg gcg etc ctg gee tet gee ega 103 
Arg Val Leu Leu Val Val Ala Leu Leu Ala Leu Leu Ala Ser Ala Arg 
5 10 15 

get tea gag gcc gag gat gcc tec ctt etc age ttc atg cag ggt tac 151 
Ala Ser Glu Ala Glu Asp Ala Ser Leu Leu Ser Phe Met Gin Gly Tyr 
20 25 30 35 

atg aag cac gcc ace aag ace gcc aag gat gca ctg age age gtg cag 199 
Met Lys His Ala Thr Lys Thr Ala Lys Asp Ala Leu Ser Ser Val Gin 
40 45 50 

gag tec cag gtg gcc cag cag gcc agg ggc tgg gtg ace gat gge ttc 24 7 

Glu Ser Gin Val Ala Gin Gin Ala Arg Gly Tirp Val Thr Asp Gly Phe 
55 60 65 

agt tec ctg aaa gac tae tgg age acc gtt aag gac aag ttc tct gag 295 
Ser Ser Leu Lys Asp Tyr Trp Ser Thr Val Lys Asp Lys Phe Ser Glu 

70 75 80 

ttc tgg gat ttg gac cct gag gtc aga cca act tea gcc gtg get gcc 343 
Phe Trp Asp Leu Asp Pro Glu Val Arg Pro Thr Ser Ala Val Ala Ala 
85 90 95 

tga gacctcaata ecceaagtec acctgcctat ecatcctgcg agctccttgg 396 


gtcctgcaat ctccagggct geeectgtag gttgettaaa agggaeagta ttcteagtgc 456 
tetcctaecc cacctcatgc ctggcccccc tccaggcatg etggeetccc aataaagctg 516 
gacaagaagc tgctatg 533 

<210> 26 
<211> 99 
<212> PRT 
<213> Homo sapien 

<400> 26 

Met Gin Pro Arg Val 

1 5 
Ser Ala Arg Ala Ser 
20 

Gin Gly Tyr Met Lys 
35 

Ser Val Gin Glu Ser 
50 

Asp Gly Phe Ser Ser 
65 

Phe Ser Glu Phe Trp 
85 

Val Ala Ala 


Leu 

Leu 

Val 

Val 

Ala 
10 

Leu 

Leu 

Ala 

Leu 

Leu 
15 

Ala 

Glu 

Ala 

Glu 

Asp 
25 

Ala 

Ser 

Leu 

Leu 

Ser 
30 

Phe 

Met 

His 

Ala 

Thr 
40 

Lys 

Thr 

Ala 

Lys 

Asp 
45 

Ala 

Leu 

Ser 

Gin 

Val 
55 

Ala 

Gin 

Gin 

Ala 

Arg 
60 

Gly 

Trp 

Val 

Thr 

Leu 

Lys 

Asp 

Tyr 

Trp 

Ser 

Thr 

Val 

Lys 

Asp 

Lys 

70 





75 





80 

Asp 

Leu 

Asp 

Pro 

Glu 
90 

Val 

Arg 

Pro 

Thr 

Ser 
95 

Ala 


<210> 27 
<211> 8925 
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<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (5020) . , . (6162) 

<223> Nucleotide encoding ATP-binding cassette (ABCl) 
<223> n= a or g or c or t 
<400> 27 

ctcagtgtca gctgctgctg gaagtggcct ggcctctatt tatcttcctg atcctgatct 60 

ctgttcggct gagctaccca ccctatgaac aacatgaatg ccattttcca aataaagcca 120 

tgccctctgc aggaacactt ccttgggttc aggggattat ctgtaatgcc aacaacccct 180 

gtttccgtta cccgactcct ggggaggctc ccggagttgt tggaaacttt aacaaatcca 240 

ttgtggctcg cctgttctca gatgctcgga ggcttctttt atacagccag aaagacacca 300 

gcatgaagga catgcgcaaa gttctgagaa cattacagca gatcaagaaa tccagctcaa 360 

acttgaagct tcaagatttc ctggtggaca atgaaacctt ctctgggttc ctgtatcaca 420 

acctctctct cccaaagtct actgtggaca agatgctgag ggctgatgtc attctccaca 480 

aggtattttt gcaaggctac cagttacatt tgacaagtct gtgcaatgga tcaaaatcag 540 

aagagatgat tcaacttggt gaccaagaag tttctgagct ttgtggccta ccaagggaga 600 

aactggctgc agcagagcga gtacttcgtt ccaacatgga catcctgaag ccaatcctga 660 

gaacactaaa ctctacatct cccttcccga gcaaggagct ggctgaagcc acaaaaacat 720 

tgctgcatag tcttgggact ctggcccagg agctgttcag catgagaagc tggagtgaca 780 

tgcgacagga ggtgatgttt ctgaccaatg tgaacagctc cagctcctcc acccaaatct 840 

accaggctgt gtctcgtatt gtctgcgggc atcccgaggg aggggggctg aagatcaagt 900 

ctctcaactg gtatgaggac aacaactaca aagccctctt tggaggcaat ggcactgagg 960 

aagatgctga aaccttctat gacaactcta caactcctta ctgcaatgat ttgatgaaga 1020 

atttggagtc tagtcctctt tcccgcatta tctggaaagc tctgaagccg ctgctcgttg 1080 

ggaagatcct gtatacacct gacactccag ccacaaggca ggtcatggct gaggtgaaca 1140 

agaccttcca ggaactggct gtgttccatg atctggaagg catgtgggag gaactcagcc 1200 

ccaagatctg gaccttcatg gagaacagcc aagaaatgga ccttgtccgg atgctgttgg 1260 

acagcaggga caatgaccac ttttgggaac agcagttgga tggcttagat tggacagccc 1320 

aagacatcgt ggcgtttttg gccaagcacc cagaggatgt ccagtccagt aatggttctg 1380 

tgtacacctg gagagaagct ttcaacgaga ctaaccaggc aatccggacc atatctcgct 1440 

tcatggagtg tgtcaacctg aacaagctag aacccatagc aacagaagtc tggctcatca 1500 

acaagtccat ggagctgctg gatgagagga agttctgggc tggtattgtg ttcactggaa 1560 

ttactccagg cagcattgag ctgccccatc atgtcaagta caagatccga atggacattg 1620 

acaatgtgga gaggacaaat aaaatcaagg atgggtactg ggaccctggt cctcgagctg 1680 

acccctttga ggacatgcgg tacgtctggg ggggcttcgc ctacttgcag gatgtggtgg 1740 

agcaggcaat catcagggtg ctgacgggca ccgagaagaa aactggtgtc tatatgcaac 1800 

agatgcccta tccctgttac gttgatgaca tctttctgcg ggtgatgagc cggtcaatgc 1860 

ccctcttcat gacgctggcc tggatttact cagtggctgt gatcatcaag ggcatcgtgt 1920 

atgagaagga ggcacggctg aaagagacca tgcggatcat gggcctggac aacagcatcc 1980 

tctggtttag ctggttcatt agtagcctca ttcctcttct tgtgagcgct ggcctgctag 2040 

tggtcatcct gaagttagga aacctgctgc cctacagtga tcccagcgtg gtgtttgtct 2100 

tcctgtccgt gtttgctgtg gtgacaatcc tgcagtgctt cctgattagc acactcttct 2160 

ccagagccaa cctggcagca gcctgtgggg gcatcatcta cttcacgctg tacctgccct 2220 

acgtcctgtg tgtggcatgg caggactacg tgggcttcac actcaagatc ttcgctagcc 2280 

tgctgtctcc tgtggctttt gggtttggct gtgagtactt tgcccttttt gaggagcagg 2340 

gcattggagt gcagtgggac aacctgtttg agagtcctgt ggaggaagat ggcttcaatc 2400 

tcaccacttc ggtctccatg atgctgtttg acaccttcct ctatggggtg atgacctggt 2460 

acattgaggc tgtctttcca ggccagtacg gaattcccag gccctggtat tttccttgca 2520 

ccaagtccta ctggtttggc gaggaaagtg atgagaagag ccaccctggt tccaaccaga 2580 

agagaatatc agaaatctgc atggaggagg aacccaccca cttgaagctg ggcgtgtcca 2640 

ttcagaacct ggtaaaagtc taccgagatg ggatgaaggt ggctgtcgat ggcctggcac 2700 

tgaattttta tgagggccag atcacctcct tcctgggcca caatggagcg gggaagacga 2760 

ccaccatgtc aatcctgacc gggttgttcc ccccgacctc gggcaccgcc tacatcctgg 2820 
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gaaaagacat 
ataacgtgct 
aagggctctc 
tgccatcaag 
tatctgtggc 
ctggtgtgga 
gccgcaccat 
t tgccatcat 
agctgggaac 
cctgcagaaa 
gcagttctga 
ctgctatctc 
ggcatgagct 
tctttcatga 
agacgaccct 
cctcagatgg 
gtcttcgccc 
aatccagaga 
gctggaaact 
gacggagtcg 
cccttgtgtt 
cctggatgta 
ccctggaact 
gaaacccaat 
t tccccagac 
ctgcatgcca 
caggggggct 
caggaagaaa 
taaagaacaa 
atactcaagc 
acctaaagct 
tgacaggact 
caatcagctc 
gagagaaccc 
agcagctctc 
t catctttgc 
tcagcaaagc 


tcgctctgag 
gtttgacatg 
tgagaagcac 
caagctgaaa 
cttggccttt 
cccttactcc 
tattctctct 
ctcccatggg 
aggctactac 
cagtagtagc 
tgctggcctg 
caacctcatc 
gacctatgtg 
gattgatgac 
ggaagaaata 
taccttgcca 
gttcactgaa 
gacagacttg 
tacacagcaa 
gaaaggattt 
cagcctgatc 
caacgaacag 
cttaaacgcc 
cccagacacg 
catcatggac 
gtgtagcagc 
gcctcctcca 
catttcggat 
gatctgggtg 
acttcctccg 
ggccaaggac 
ggacaccaaa 
tttcctgaat 
tagccattat 
agaggtggct 
aatgtccttc 
aaaacacctg 


atgagcacca 
ctgactgtcg 
gtgaaggcgg 
agcaaaacaa 
gtcgggggat 
cgcaggggaa 
acacaccaca 
aagctgtgct 
ctgaccttgg 
actgtgtcat 
ggcagcgacc 
aggaagcatg 
ctgccatatg 
cggctctcag 
ttcctcaagg 
gcaagacgaa 
gatgatgctg 
ctcagtggga 
cagtttgtgg 
tttgctcaga 
gtgccaccct 
tacacatttg 
ctcaccaaag 
ccctgccagg 
ctcttccaga 
gacaaaatca 
caaagaaaac 
tatctggtga 
aatgagttta 
agtcaagaag 
agttctgcag 
aataatgtca 
gtcatcaaca 
ggaattactg 
cggatgacca 
gtcccagcca 
cagttcatca 


tccggcagaa 
aagaacacat 
agatggagca 
gccagctgtc 
ctaaggttgt 
tatgggagct 
tggatgaagc 
gtgtgggctc 
tcaagaaaga 
acctgaaaaa 
atgagagtga 
tgtctgaagc 
aagctgctaa 
acctgggcat 
tggccgaaga 
acaggcgggc 
ctgatccaaa 
tggatggcaa 
ccct tttgtg 
ttgtct tgcc 
ttggcaagta 
tcagcaatga 
accctggctt 
caggggagga 
atgggaactg 
agaagatgct 
aaaacactgc 
agacgtatgt 
ggtatggcgg 
ttaatgatgc 
atcgatttct 
aggtgtggtt 
atgccattct 
ctttcaatca 
catcagtgga 
gctttgtcgt 
gtggagtga 


cctgggggtc 
ctggttctat 
gatggccctg 
aggtggaatg 
cattctggat 
gctgctgaaa 
ggacgtcctg 
ctccctgttt 
tgtggaatcc 
ggaggacagt 
cacgctgacc 
ccggctggtg 
ggagggagcc 
ttctagttat 
gagtggggtg 
cttcggggac 
tgattctgac 
agggtcctac 
gaagagactg 
agctgtgttt 
ccccagcctg 
tgctcctgag 
cgggacccgc 
agagtggacc 
gacaatgcag 
gcctgtgtgt 
agatatcctt 
gcagatcata 
cttttccctg 
catcaaacaa 
caacagcttg 
caataacaag 
ccgggccaac 
tcccctgaat 
tgtccttgtg 
attcctgatc 
age ctg tea 
Ser Leu Ser 
1 


tgtccccagc 
gcccgcttga 
gatgttggtt 
cagagaaagc 
gaacccacag 
taccgacaag 
ggggacagga 
ctgaagaacc 
tccctcagtt 
gtttctcaga 
atcgatgtct 
gaagacatag 
tttgtggaac 
ggcatctcag 
gatgctgaga 
aagcagagct 
atagacccag 
caggtgaaag 
ctaattgcca 
gtctgcattg 
gaacttcagc 
gacacgggaa 
tgtatggaag 
actgccccag 
aacccttcac 
cccccagggg 
caggacctga 
gccaaaagct 
ggtgtcagta 
atgaagaaac 
ggaagattta 
ggctggcatg 
ctgcaaaagg 
ctcaccaagc 
tccatctgtg 
caggagcggg 
tct act 
Ser Thr 
5 


2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5034 


ggc tct eta att ttg tct ggg ata tgt gca att aag ttg ttt cca ann 
Gly Ser Leu He Leu Ser Gly He Cys Ala He Lys Leu Phe Pro Xaa 
10 15 20 


5082 


nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
25 30 35 


5130 


nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn nnn 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
40 45 50 


5178 


nta ate ttt ect ttt cag tgc ttt ggg etc ctg gga gtt aat ggg get 
Xaa He Phe Pro Phe Gin Cys Phe Gly Leu Leu Gly Val Asn Gly Ala 
55 60 65 


5226 


gga aaa tea tea act ttc aag atg tta aca gga gat ace act gtt ace 
Gly Lys Ser Ser Thr Phe Lys Met Leu Thr Gly Asp Thr Thr Val Thr 


5274 


wo 02/072604 


PCT/US02/06728 


-34" 


70 75 80 85 

99^ 9at get ttc ctt aac att tgc agt ate tta tea aac ate cat 5322 
Arg Gly Asp Ala Phe Leu Asn lie Cys Ser lie Leu Ser Asn lie His 
90 95 100 

gaa gta eat cag aac atg gge tae tgc cct cag ttt gat gcc ate aca 5370 
Glu Val His Gin Asn Met Gly Tyr Cys Pro Gin Phe Asp Ala lie Thr 
105 110 115 

gag ctg ttg act ggg aga gaa cac gtg gag tte ttt gcc ctt ttg aga 5418 
Glu Leu Leu Thr Gly Arg Glu His Val Glu Phe Phe Ala Leu Leu Arg 
120 125 130 

gga gtc cca gag aaa gaa gtt gge aag gtt ggt gag tgg geg att egg 5466 
Gly Val Pro Glu Lys Glu Val Gly Lys Val Gly Glu Trp Ala lie Arg 
135 140 145 

aaa ctg gge etc gtg aag tat gga gaa aaa tat get ggt aac tat agt 5514 
Lys Leu Gly Leu Val Lys Tyr Gly Glu Lys Tyr Ala Gly Asn Tyr Ser 
150 155 160 165 

gga gge aac aaa cgc aag etc tct aca gcc atg get ttg ate gge ggg 5562 
Gly Gly Asn Lys Arg Lys Leu Ser Thr Ala Met Ala Leu lie Gly Gly 
170 175 180 

cct cct gtg gtg ttt ctg gat gaa ccc ace aca gge atg gat cec aaa 5610 
Pro Pro Val Val Phe Leu Asp Glu Pro Thr Thr Gly Met Asp Pro Lys 
185 190 195 

gee egg egg ttc ttg tgg aat tgt gcc eta agt gtt gtc aag gag ggg 5658 
Ala Arg Arg Phe Leu Trp Asn Cys Ala Leu Ser Val Val Lys Glu Gly 
200 205 210 

aga tea gta gtg ctt aca tct eat agt atg gaa gaa tgt gaa get ctt 5706 
Arg Ser Val Val Leu Thr Ser His Ser Met Glu Glu Cys Glu Ala Leu 
215 220 225 

tgc act agg atg gea ate atg gtc aat gga agg tte agg tgc ctt gge 5754 
Cys Thr Arg Met Ala lie Met Val Asn Gly Arg Phe Arg Cys Leu Gly 
230 235 240 245 

^9t gtc cag cat eta aaa aat agg ttt gga gat ggt tat aca ata gtt 5802 
Ser Val Gin His Leu Lys Asn Arg Phe Gly Asp Gly Tyr Thr lie Val 
250 255 260 

gta cga ata gea ggg tec aac ccg gae ctg aag cct gtc cag gat ttc 5850 
Val Arg lie Ala Gly Ser Asn Pro Asp Leu Lys Pro Val Gin Asp Phe 
265 270 275 

ttt gga ctt gea ttt cct gga agt gtt eta aaa gag aaa cac egg aac 5898 
Phe Gly Leu Ala Phe Pro Gly Ser Val Leu Lys Glu Lys His Arg Asn 
280 285 290 

atg eta caa tae cag ctt cca tct tea tta tct tct ctg gcc agg ata 5946 
Met Leu Gin Tyr Gin Leu Pro Ser Ser Leu Ser Ser Leu Ala Arg lie 
295 300 305 
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ttc age ate etc tec cag age aaa aag cga etc cac ata gaa gac tac 
Phe Ser lie Leu Ser Gin Ser Lys Lys Arg Leu His lie Glu Asp Tyr 
310 315 320 325 


5994 


tet gtt tet cag aca aca ctt gac caa gta ttt gtg aac ttt gee aag 
Ser Val Ser Gin Thr Thr Leu Asp Gin Val Phe Val Asn Phe Ala Lys 
330 335 340 


6042 


gac caa agt gat gat gac cac tta aaa gac etc tea tta cac aaa aac 
Asp Gin Ser Asp Asp Asp His Leu Lys Asp Leu Ser Leu His Lys Asn 
345 350 355 


6090 


cag aca gta gtg gac gtt gca gtt etc aca tet ttt eta cag gat gag 
Gin Thr Val Val Asp Val Ala Val Leu Thr Ser Phe Leu Gin Asp Glu 
360 365 370 


6138 


aaa gtg aaa gaa age tat gta tga agaatectgt tcatacgggg tggctgaaag 
Lys Val Lys Glu Ser Tyr Val * 
375 380 


6192 


taaagaggaa 
gttgatgtgg 
aatgcaatga 
ggctctcaag 
tggaacecaa 
tattctcatt 
aaaatcaaaa 
ttcccggtga 
tcagaaagt t 
gagtctatea 
tttgattcec 
tccaggeacg 
acagggtcat 
aacagccaaa 
tgcgttcaaa 
ctgcatctea 
agaaaatgaa 
ggaattt taa 
gtggtataga 
atcagtcatg 
tgagtttctt 
tgagtgtatg 
tetcagtaca 
tgtggttgta 
gcaacaatgc 
cagggaaaac 
gtaectteaa 
attttteate 
ggagcat t t t 
aaatactgt t 
aactattgt t 
ttccagaat t 
tatatttttt 
gttttagaag 
aactagtgat 
taatttctcc 
atcgtattct 
gattteaagg 


ctagactttc 
gaagaagtaa 
aaacaaaatt 
tgaaagaett 
tggacatatg 

ggggttgcaa 

ggtaatgcac 
cacatccatt 
tgaagcacca 
acattgaata 
tctgataagc 
ggaaacttgg 
ccttatgaga 
ctgctggggc 
cctagggaag 
agatgtttat 
aagatggagt 
gttctatcag 
ggagtatggc 
actagtgect 
tcagatcatt 
ctgtagctga 
cttcctgtgc 
ggagcecact 
aaaagceaag 
agctagcttg 
ataat tgget 
tetteaatea 
tcagagtttt 
teactaatac 
tggggaagtt 
tgaatattaa 
aaatttacag 
ttaaagtcaa 
atggcatcgt 
tcatttaata 
aat tcttcat 
ecatatttta 


etttgcacca 
actggatact 
ceattacagg 
gaatttagtt 
ggtttgaaet 
caataattea 
atcctcattc 
gctggcaatg 
tggtgtgtea 
tcagttgaea 
tgttctggtg 
ttccattgtt 
ctcttaaata 
tgcaagctgc 
cctgtgccca 
ctgacacaag 
tgtattttga 
tgacttctga 
cactgcccca 
agaaagcaat 
taggatactc 
aagagtatgt 
catgttatte 
gtaacaatae 
aaagtataag 
aaaact tgct 
ttgcagatat 
etagtcaaga 
etaacccagt 
ttaetgttaa 
caagtgatct 
cgctaaaggt 
aatattatat 
tattgatttt 
tgcattttac 
tttttcaaaa 
tatagtaaat 
aaaaatcaaa 


tgtgaagtgt 
gtactgatac 
ggcagtgcct 
ttttacctat 
eacacttttt 
tpaagtaate 
actaageeat 
agtgtgeeag 
tgctcaettt 
gaatggtgec 
gcagtaaeat 
atattgtect 
tacttagatc 
tgaagceagg 
tttgtcetga 
tgtattattt 
caaaaatgtt 
atccttagaa 
ctatttttat 
gtgatggtca 
ttaatctcac 
acgtacgtat 
agctcactgg 
tgggcagcct 
ggtcacaagt 
gaaaaaeaca 
tggataeeec 
aaaatataaa 
cttatttttc 
ctgtct tgag 
ttcaatatea 
gtaagacttc 
aacccactgc 
aaatataagt 
agtatcttca 
tcaaagttat 
ctatgagcaa 
aggeactgtg 


tgtggagaaa 
tattcaatgc 
ttgtagccta 
acctatgtga 
tttttttttt 
atggccagcg 
gecatgecca 
agttat tagt 
tgtgaaaget 
atgegtgget 
gcaaeaaaaa 
atgcttcgag 
ctggtaagag 
gcatgggatt 
ctgtctgcta 
ctggcttttt 
tgtacttttt 
tggcctcttt 
tttcttatgt 
ggatctcatg 
ttcatcaatc 
aagactagag 
tttacaaata 
tttttttttt 
ctaaacaatg 
acttgtgttt 
attaaatctg 
aaeaaeaaat 
tagtcagtaa 
agaaaagaaa 
ttactaact t 
agat ttcaaa 
tgaaaaagaa 
aatgaaggca 
aaaatacaga 
ggtttcctca 
ctecttactt 
aactattttg 


agagccagaa 
aatgcaattc 
tgtcttgtat 
aactctatta 
tgttcctgtg 
attattgatc 
ggagactggt 
gccaagtttt 
gctctgetea 
aacatectge 
tgtgggtgtc 
ccatgggtct 
geaaagaatc 
aaagagattg 
acatggtaca 
gaat taatct 
aatgttattt 
gtagaaccct 
aagtttgcat 
acattatatt 
aaatattttt 
agatattaag 
taggttgtet 
ttttttaatt 
aattcttcaa 
atggcattta 
acagtctcaa 
acttceatat 
acatt tgtaa 
aatatgagag 
et tccacttt 
ttaatctttc 
aaaaatgatt 
tatttccaat 
at ttatagaa 
tt ttactaaa 
eggttectet 
aagaaaacac 


6252 
6312 
6372 
6432 
6492 
6552 
6612 
6672 
6732 
6792 
6852 
6912 
6972 
7032 
7092 
7152 
7212 
7272 
7332 
7392 
7452 
7512 
7572 
7632 
7692 
7752 
7812 
7872 
7932 
7992 
8052 
8112 
8172 
8232 
8292 
8352 
8412 
8472 
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aacatt ttaa 
atcttcatta 
cctaattgtg 
tccctgtgga 
ctgct tttgc 
acagaagaaa 
caaccaaacc 
attaaatgtt 


tacagattga 
atactgtgtt 
gtagaaat tt 
atgtacctat 
atctttggga 
ataataagcc 
tcacactact 
gcacatcatt 


aaggacctct 
accttttaaa 
t taccaactc 
gtgagtttca 
cacctcagaa 
ctctatacat 
gtatttcatt 
cattcactgt 


tctgaagcta 
atagtaattt 
tatactcaat 
gaaattctca 
aact tattaa 
aaatgcccag 
atctgtactg 
ata 


gaaacaatct 
tttacatttt 
caagcaaaat 
aaatacgtgt 
caactgtgaa 
cacaattcat 
aaagcaaatg 


atagttatac 
cctgtgtaaa 
ttctgtatat 
tcaaaaattt 
tatgagaaat 
tgttaaaaaa 
ctttgtgact 


8532 
8592 
8652 
8712 
8772 
8832 
8892 
8925 


<210> 28 

<211> 380 

<212> PRT 

<213> Homo sapien 

<220> 

<221> UNSURE 
<222> (21) . . . (54) 
<22 3> Xaa = unknown 


<400> 28 


Ser 

Leu 

Ser 

Ser 

Thr 

Gly 

Ser 

Leu 

He 

Leu 

Ser 

Gly 

He 

Cys 

Ala 

He 

1 




5 





10 





15 


Lys 

Leu 

Phe 

Pro 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 



20 





25 





30 



Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 



35 





40 





45 




Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

Xaa 

He 

Phe 

Pro 

Phe 

Gin 

Cys 

Phe 

Gly 

Leu 

Leu 


50 





55 





60 





Gly 

Val 

Asn 

Gly 

Ala 

Gly 

Lys 

Ser 

Ser 

Thr 

Phe 

Lys 

Met 

Leu 

Thr 

Gly 

65 





70 





75 





80 

Asp 

Thr 

Thr 

Val 

Thr 

Arg 

Gly 

Asp 

Ala 

Phe 

Leu 

Asn 

He 

Cys 

Ser 

He 





85 





90 





95 


Leu 

Ser 

Asn 

He 

His 

Glu 

Val 

His 

Gin 

Asn 

Met 

Gly 

Tyr 

Cys 

Pro 

Gin 




100 





105 





110 



Phe 

Asp 

Ala 

He 

Thr 

Glu 

Leu 

Leu 

Thr 

Gly 

Arg 

Glu 

His 

Val 

Glu 

Phe 



115 





120 





125 




Phe 

Ala 

Leu 

Leu 

Arg 

Gly 

Val 

Pro 

Glu 

Lys 

Glu 

Val 

Gly 

Lys 

Val 

Gly 


130 





135 





140 





Glu 

Trp 

Ala 

He 

Arg 

Lys 

Leu 

Gly 

Leu 

Val 

Lys 

Tyr 

Gly 

Glu 

Lys 

Tyr 

145 





150 





155 





160 

Ala 

Gly 

Asn 

Tyr 

Ser 

Gly 

Gly 

Asn 

Lys 

Arg 

Lys 

Leu 

Ser 

Thr 

Ala 

Met 





165 





170 





175 


Ala 

Leu 

He 

Gly 

Gly 

Pro 

Pro 

Val 

Val 

Phe 

Leu 

Asp 

Glu 

Pro 

Thr 

Thr 




180 





185 





190 



Gly 

Met 

Asp 

Pro 

Lys 

Ala 

Arg 

Arg 

Phe 

Leu 

Trp 

Asn 

Cys 

Ala 

Leu 

Ser 



195 





200 





205 




Val 

Val 

Lys 

Glu 

Gly 

Arg 

Ser 

Val 

Val 

Leu 

Thr 

Ser 

His 

Ser 

Met 

Glu 


210 





215 





220 





Glu 

Cys 

Glu 

Ala 

Leu 

Cys 

Thr 

Arg 

Met 

Ala 

He 

Met 

Val 

Asn 

Gly 

Arg 

225 





230 





235 





240 

Phe 

Arg 

Cys 

Leu 

Gly 

Ser 

Val 

Gin 

His 

Leu 

Lys 

Asn 

Arg 

Phe 

Gly 

Asp 





245 





250 





255 


Gly 

Tyr 

Thr 

He 

Val 

Val 

Arg 

He 

Ala 

Gly 

Ser 

Asn 

Pro 

Asp 

Leu 

Lys 




260 





265 





270 



Pro 

Val 

Gin 

Asp 

Phe 

Phe 

Gly 

Leu 

Ala 

Phe 

Pro 

Gly 

Ser 

Val 

Leu 

Lys 



275 





280 





285 




Glu 

Lys 

His 

Arg 

Asn 

Met 

Leu 

Gin 

Tyr 

Gin 

Leu 

Pro 

Ser 

Ser 

Leu 

Ser 


290 295 300 
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Ser Leu Ala Arg 
305 

His lie Glu Asp 

Val Asn Phe Ala 
340 

Ser Leu His Lys 
355 

Phe Leu Gin Asp 
370 


lie Phe Ser lie 
310 

Tyr Ser Val Ser 
325 

Lys Asp Gin Ser 

Asn Gin Thr Val 
360 

Glu Lys Val Lys 
375 


Leu Ser Gin Ser 
315 

Gin Thr Thr Leu 
330 

Asp Asp Asp His 
345 

Val Asp Val Ala 

Glu Ser Tyr Val 
380 


Lys Lys Arg Leu 
320 

Asp Gin Val Phe 
335 

Leu Lys Asp Leu 
350 

Val Leu Thr Ser 
365 


<210> 29 

<211> 897 

<212> DNA 

<213> Homo sapien 


<220> 

<221> CDS 

<222> (39) , . . (842) 

<223> Nucleotide sequence encoding apolipoprotein A-1 
(APOAl) 


<400> 29 

agagactgcg agaaggaggt cccccacggc ccttcagg atg aaa get gcg gtg ctg 56 

Met Lys Ala Ala Val Leu 
1 5 

acc ttg gcc gtg etc ttc ctg acg ggg age cag get egg eat ttc tgg 104 
Thr Leu Ala Val Leu Phe Leu Thr Gly Ser Gin Ala Arg His Phe Trp 
10 15 20 

cag caa gat gaa ccc ccc cag age cec tgg gat cga gtg aag gae ctg 152 
Gin Gin Asp Glu Pro Pro Gin Ser Pro Trp Asp Arg Val Lys Asp Leu 
25 30 35 

gcc act gtg tac gtg gat gtg etc aaa gae age ggc aga gac tat gtg 200 
Ala Thr Val Tyr Val Asp Val Leu Lys Asp Ser Gly Arg Asp Tyr Val 
40 45 50 

tec cag ttt gaa ggc tec gcc ttg gga aaa cag eta aac eta aag etc 248 
Ser Gin Phe Glu Gly Ser Ala Leu Gly Lys Gin Leu Asn Leu Lys Leu 
55 60 65 70 

ett gae aac tgg gae age gtg ace tec ace ttc age aag ctg cge gaa 2 96 

Leu Asp Asn Trp Asp Ser Val Thr Ser Thr Phe Ser Lys Leu Arg Glu 
75 80 85 

cag etc ggc cct gtg acc cag gag ttc tgg gat aac ctg gaa aag gag 344 
Gin Leu Gly Pro Val Thr Gin Glu Phe Trp Asp Asn Leu Glu Lys Glu 
90 95 100 

aca gag ggc ctg agg cag gag atg age aag gat ctg gag gag gtg aag 392 
Thr Glu Gly Leu Arg Gin Glu Met Ser Lys Asp Leu Glu Glu Val Lys 
105 110 115 

gcc aag gtg cag ccc tac ctg gac gac ttc cag aag aag tgg cag gag 440 
Ala Lys Val Gin Pro Tyr Leu Asp Asp Phe Gin Lys Lys Trp Gin Glu 
120 125 130 
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gag atg gag etc tac cgc cag aag gtg gag ccg ctg cgc gca gag etc 488 
Glu Met Glu Leu Tyr Arg Gin Lys Val Glu Pro Leu Arg Ala Glu Leu 
135 140 145 150 

caa gag ggc gcg cgc cag aag ctg cac gag ctg caa gag aag ctg age 53 6 

Gin Glu Gly Ala Arg Gin Lys Leu His Glu Leu Gin Glu Lys Leu Ser 
155 160 165 

cca ctg ggc gag gag atg cgc gac cgc gcg cgc gcc cat gtg gac gcg 584 
Pro Leu Gly Glu Glu Met Arg Asp Arg Ala Arg Ala His Val Asp Ala 
170 175 180 

ctg cgc acg cat ctg gcc ccc tac age gac gag ctg cgc cag . cgc ttg 632 
Leu Arg Thr His Leu Ala Pro Tyr Ser Asp Glu Leu Arg Gin Arg Leu 
185 190 195 

gcc gcg cgc ctt gag get etc aag gag aac ggc ggc gcc aga ctg gee 680 
Ala Ala Arg Leu Glu Ala Leu Lys Glu Asn Gly Gly Ala Arg Leu Ala 
200 205 210 

gag tac cac gcc aag gcc ace gag eat ctg age acg etc age gag aag 72 8 

Glu Tyr His Ala Lys Ala Thr Glu His Leu Ser Thr Leu Ser Glu Lys 
215 220 225 230 

gee aag ccc gcg etc gag gac etc cgc caa ggc ctg ctg ccc gtg ctg 776 
Ala Lys Pro Ala Leu Glu Asp Leu Arg Gin Gly Leu Leu Pro Val Leu 
235 240 245 

gag age tte aag gtc age ttc ctg age get etc gag gag tac act aag 824 
Glu Ser Phe Lys Val Ser Phe Leu Ser Ala Leu Glu Glu Tyr Thr Lys 
250 255 260 

aag etc aac acc cag tga ggegcecgee gcegcccccc ttcccggtgc 872 
Lys Leu Asn Thr Gin * 
265 

teagaataaa cgtttccaaa gtggg 897 

<210> 30 

<211> 267 

<212> PRT 

<213> Homo sapien 

<400> 30 

Met Lys Ala Ala Val Leu Thr Leu Ala Val Leu Phe Leu Thr Gly Ser 

15 10 15 

Gin Ala Arg His Phe Trp Gin Gin Asp Glu Pro Pro Gin Ser Pro Trp 

20 25 30 

Asp Arg Val Lys Asp Leu Ala Thr Val Tyr Val Asp Val Leu Lys Asp 

35 40 45 

Ser Gly Arg Asp Tyr Val Ser Gin Phe Glu Gly Ser Ala Leu Gly Lys 

50 55 60 

Gin Leu Asn Leu Lys Leu Leu Asp Asn Trp Asp Ser Val Thr Ser Thr 
65 70 75 80 

Phe Ser Lys Leu Arg Glu Gin Leu Gly Pro Val Thr Gin Glu Phe Trp 

85 90 95 

Asp Asn Leu Glu Lys Glu Thr Glu Gly Leu Arg Gin Glu Met Ser Lys 
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100 





105 





110 



Asp 

Leu 

Glu 
115 

Glu 

Val 

Lys 

Ala 

Lys 
120 

Val 

Gin 

Pro 

Tyr 

Leu 
125 

Asp 

Asp 

Phe 

Gin 

Lys 
130 

Lys 

Trp 

Gin 

Glu 

Glu 
135 

Met 

Glu 

Leu 

Tyr 

Arg 
140 

Gin 

Lys 

Val 

Glu 

Pro 

Leu 

Arg 

Ala 

Glu 

Leu 

Gin 

Glu 

Gly 

Ala 

Arg 

Gin 

Lys 

Leu 

His 

Glu 

145 





150 





155 





160 

Leu 

Gin 

Glu 

Lys 

Leu 
165 

Ser 

Pro 

Leu 

Gly 

Glu 
170 

Glu 

Met 

Arg 

Asp 

Arg 
175 

Ala 

Arg 

Ala 

His 

Val 
180 

Asp 

Ala 

Leu 

Arg 

Thr 
185 

His 

Leu 

Ala 

Pro 

Tyr 
190 

Ser 

Asp 

Glu 

Leu 

Arg 
195 

Gin 

Arg 

Leu 

Ala 

Ala 
200 

Arg 

Leu 

Glu 

Ala 

Leu 
205 

Lys 

Glu 

Asn 

Gly 

Gly 
210 

Ala 

Arg 

Leu 

Ala 

Glu 
215 

Tyr 

His 

Ala 

Lys 

Ala 
220 

Thr 

Glu 

His 

Leu 

Ser 

Thr 

Leu 

Ser 

Glu 

Lys 

Ala 

Lys 

Pro 

Ala 

Leu 

Glu 

Asp 

Leu 

Arg 

Gin 

225 





230 





235 





240 

Gly 

Leu 

Leu 

Pro 

Val 
245 

Leu 

Glu 

Ser 

Phe 

Lys 
250 

Val 

Ser 

Phe 

Leu 

Ser 
255 

Ala 

Leu 

Glu 

Glu 

Tyr 
260 

Thr 

Lys 

Lys 

Leu 

Asn 
265 

Thr 

Gin 







<210> 31 

<211> 14121 

<212> DNA 

<213> Homo sapien 


<220> 
<221> CDS 

<222> (129) . (13820) 

<223> Nucleotide sequence encoding apolipoprotein B 
(APOB) 

<400> 31 

attcccaccg ggacctgcgg ggctgagtgc ccttctcggt tgctgccgct gaggagcccg 60 
cccagccagc cagggccgcg aggccgaggc caggccgcag cccaggagcc gccccaccgc 12 0 
agctggcg atg gac ccg ccg agg ccc gcg ctg ctg gcg ctg ctg gcg ctg 170 
Met Asp Pro Pro Arg Pro Ala Leu Leu Ala Leu Leu Ala Leu 
15 10 

cct gcg ctg ctg ctg ctg ctg ctg gcg ggc gcc agg gcc gaa gag gaa 218 
Pro Ala Leu Leu Leu Leu Leu Leu Ala Gly Ala Arg Ala Glu Glu Glu 
15 20 25 30 

atg ctg gaa aat gtc age ctg gtc tgt cca aaa gat gcg acc cga ttc 266 
Met Leu Glu Asn Val Ser Leu Val Cys Pro Lys Asp Ala Thr Arg Phe 
35 40 45 

aag cac etc egg aag tac aca tac aac tat gag get gag agt tee agt 314 
Lys His Leu Arg Lys Tyr Thr Tyr Asn Tyr Glu Ala Glu Ser Ser Ser 
50 55 60 

gga gtc cct ggg act get gat tea aga agt gcc acc agg ate aac tgc 362 
Gly Val Pro Gly Thr Ala Asp Ser Arg Ser Ala Thr Arg lie Asn Cys 
65 70 75 

aag gtt gag ctg gag gtt ccc cag etc tgc age ttc ate ctg aag ace 410 
Lys Val Glu Leu Glu Val Pro Gin Leu Cys Ser Phe lie Leu Lys Thr 
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80 

age cag tgc 
Ser Gin Cys 
95 

gcc ttg ctg 
Ala Leu Leu 


tec agg tat 
Ser Arg Tyr 


ctt tac ccg 
Leu Tyr Pro 
145 

ggc ate att 
Gly lie lie 
160 

caa gtg ttg 
Gin Val Leu 
175 

acc gtc aag 
Thr Val Lys 


aga gac ctg 
Arg Asp Leu 


age cca ctt 
Ser Pro Leu 
225 

ate age age 
lie Ser Ser 
240 

cat gtg gea 
His Val Ala 
255 

tec tac aac 
Ser Tyr Asn 


aaa ctt gaa 
Lys Leu Glu 


act aag aag 
Thr Lys Lys 
305 


acc ctg aaa 
Thr Leu Lys 
100 

aag aaa ace 
Lys Lys Thr 
115 

gag etc aag 
Glu Leu Lys 
130 

gag aaa gat 
Glu Lys Asp 


tct gcc etc 
Ser Ala Leu 


ttt ctg gat 
Phe Leu Asp 
180 

acg agg aag 
Thr Arg Lys 
195 

ggg cag tgt 
Gly Gin Cys 
210 

get etc ate 
Ala Leu lie 


age cag tec 
Ser Gin Ser 


gaa gcc ate 
Glu Ala lie 
260 

aat aag tat 
Asn Lys Tyr 
275 

gac aca cca 
Asp Thr Pro 
290 

atg ggc etc 
Met Gly Leu 


85 

gag gtg tat 
Glu Val Tyr 


aag aac tct 
Lys Asn Ser 


ctg gcc att 
Leu Ala lie 
135 

gaa cct act 
Glu Pro Thr 
150 

ctg gtt ccc 
Leu Val Pro 
165 

acc gtg tat 
Thr Val Tyr 


ggc aat gtg 
Gly Asn Val 


gat cgc ttc 
Asp Arg Phe 
215 

aaa ggc atg 
Lys Gly Met 
230 

tgt cag tac 
Cys Gin Tyr 
245 

tgc aag gag 
Cys Lys Glu 


ggg atg gta 
Gly Met Val 


aag ate aac 
Lys lie Asn 
295 

gea ttt gag 
Ala Phe Glu 
310 


90 

ggc ttc aac 
Gly Phe Asn 
105 

gag gag ttt 
Glu Glu Phe 
120 

cca gaa ggg 
Pro Glu Gly 


tac ate ctg 
Tyr lie Leu 


cca gag aca 
Pro Glu Thr 
170 

gga aac tgc 
Gly Asn Cys 
185 

gea aca gaa 
Ala Thr Glu 
200 

aag ccc ate 
Lys Pro lie 


acc cgc ccc 
Thr Arg Pro 


aca ctg gac 
Thr Leu Asp 
250 

caa cac etc 
Gin His Leu 
265 

gea caa gtg 
Ala Gin Val 
280 

age cgc ttc 
Ser Arg Phe 


age acc aaa 
Ser Thr Lys 


cct gag ggc 
Pro Glu Gly 


get gea gee 
Ala Ala Ala 
125 

aag cag gtt 
Lys Gin Val 
140 

aac ate aag 
Asn lie Lys 
155 

gaa gaa gcc 
Glu Glu Ala 


tee act eae 
Ser Thr His 


ata tec act 
He Ser Thr 
205 

cgc aca ggc 
Arg Thr Gly 
220 

ttg tea act 
Leu Ser Thr 
235 

get aag agg 
Ala Lys Arg 


ttc ctg cct 
Phe Leu Pro 


aca cag act 
Thr Gin Thr 
285 

ttt ggt gaa 
Phe Gly Glu 
300 

tee aca tea 
Ser Thr Ser 
315 


aaa 458 

Lys 

110 

atg 506 
Met 


ttc 554 
Phe 


agg 602 
Arg 


aag 650 
Lys 


ttt 698 

Phe 

190 

gaa 746 
Glu 


ate 794 
He 


ctg 842 
Leu 


aag 890 
Lys 


ttc 938 

Phe 

270 

ttg 986 
Leu 


ggt 1034 
Gly 


cct 1082 
Pro 


wo 02/072604 


PCT/US02/06728 


"41- 


cca aag cag gcc gaa get gtt ttg aag act etc cag gaa ctg aaa aaa 1130 
Pro Lys Gin Ala Glu Ala Val Leu Lys Thr Leu Gin Glu Leu Lys Lys 
320 325 330 

eta acc ate tct gag caa aat ate cag aga get aat ete ttc aat aag 1178 
Leu Thr lie Ser Glu Gin Asn He Gin Arg Ala Asn Leu Phe Asn Lys 
335 340 345 350 

ctg gtt act gag ctg aga ggc etc agt gat gaa gca gtc aca tct etc 1226 
Leu Val Thr Glu Leu Arg Gly Leu Ser Asp Glu Ala Val Thr Ser Leu 
355 360 365 

ttg cea eag ctg att gag gtg tec age ecc ate act tta caa gcc ttg 1274 
Leu Pro Gin Leu He Glu Val Ser Ser Pro He Thr Leu Gin Ala Leu 
370 375 380 

gtt cag tgt gga cag cet eag tgc tec act eac ate etc cag tgg ctg 1322 
Val Gin Cys Gly Gin Pro Gin Cys Ser Thr His He Leu Gin Trp Leu 
385 390 395 

aaa egt gtg cat gcc aac ecc ctt ctg ata gat gtg gtc acc tac ctg 1370 
Lys Arg Val His Ala Asn Pro Leu Leu He Asp Val Val Thr Tyr Leu 
400 405 410 

gtg gcc ctg ate ecc gag ecc tea gca cag eag ctg cga gag ate ttc 1418 
Val Ala Leu He Pro Glu Pro Ser Ala Gin Gin Leu Arg Glu He Phe 
415 420 425 430 

aac atg gcg agg gat eag ege age cga gee ace ttg tat geg ctg age 1466 
Asn Met Ala Arg Asp Gin Arg Ser Arg Ala Thr Leu Tyr Ala Leu Ser 
435 440 445 

eac geg gtc aac aac tat cat aag aca aac cct aca ggg acc cag gag 1514 
His Ala Val Asn Asn Tyr His Lys Thr Asn Pro Thr Gly Thr Gin Glu 
450 455 460 

ctg ctg gac att get aat tac ctg atg gaa cag att caa gat gae tgc 1562 
Leu Leu Asp He Ala Asn Tyr Leu Met Glu Gin He Gin Asp Asp Cys 
465 470 475 

act ggg gat gaa gat tac acc tat ttg att ctg egg gtc att gga aat 1610 
Thr Gly Asp Glu Asp Tyr Thr Tyr Leu He Leu Arg Val He Gly Asn 
480 485 490 

atg ggc caa acc atg gag cag tta act cca gaa etc aag tct tea ate 1658 
Met Gly Gin Thr Met Glu Gin Leu Thr Pro Glu Leu Lys Ser Ser He 
495 500 505 510 

etc aaa tgt gtc caa agt aca aag cea tea ctg atg ate eag aaa get 1706 
Leu Lys Cys Val Gin Ser Thr Lys Pro Ser Leu Met He Gin Lys Ala 
515 520 525 

gee ate cag get ctg egg aaa atg gag cct aaa gae aag gae cag gag 1754 
Ala He Gin Ala Leu Arg Lys Met Glu Pro Lys Asp Lys Asp Gin Glu 
530 535 540 

gtt ctt ett eag act ttc ctt gat gat get tct eeg gga gat aag cga 1802 
Val Leu Leu Gin Thr Phe Leu Asp Asp Ala Ser Pro Gly Asp Lys Arg 
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545 

ctg get gcc 
Leu Ala Ala 
560 

aac aaa att 
Asn Lys lie 
575 

aac ttt gtg 
Asn Phe Val 


gat ate caa 
Asp lie Gin 


caa ctt cca 
Gin Leu Pro 
625 

etc tac aaa 
Leu Tyr Lys 
640 

ata gaa ggg 
lie Glu Gly 
655 

age atg etg 
Ser Met Leu 


etc ate gag 
Leu lie Glu 


get ctt ttt 
Ala Leu Phe 
705 

ttg tae tgg 
Leu Tyr Trp 
720 

gtg gac eae 
Val Asp His 
735 

gta aat gga 
Val Asn Gly 


tec aaa gaa 
Ser Lys Glu 


tat ctt atg 
Tyr Leu Met 


gtc caa att 
Val Gin lie 
580 

get tec cat 
Ala Ser His 
595 

gat ctg aaa 
Asp Leu Lys 
610 

act gtc atg 
Thr Val Met 


tct gtt tct 
Ser Val Ser 


aat ctt ata 
Asn Leu lie 
660 

aaa act ace 
Lys Thr Thr 
675 

att ggc ttg 
lie Gly Leu 
690 

ggg aag caa 
Gly Lys Gin 


gtt aat ggt 
Val Asn Gly 


ttt ggc tat 
Phe Gly Tyr 
740 

ata atg etc 
lie Met Leu 
755 

gtc ccg gaa 
Val Pro Glu 
770 


550 

ttg atg agg 
Leu Met Arg 
565 

eta cca tgg 
Leu Pro Trp 


att gee aat 
lie Ala Asn 


aag tta gtg 
Lys Leu Val 
615 

gac ttc aga 
Asp Phe Arg 
630 

ctt cca tea 
Leu Pro Ser 
645 

ttt gat cca 
Phe Asp Pro 


etc act gcc 
Leu Thr Ala 


gaa gga aaa 
Glu Gly Lys 
695 

gga ttt ttc 
Gly Phe Phe 
710 

caa gtt cet 
Gin Val Pro 
725 

ace aaa gat 
Thr Lys Asp 


agt gtt gag 
Ser Val Glu 


gee aga gcc 
Ala Arg Ala 

775 


agt cct tea 
Ser Pro Ser 
570 

gaa eag aat 
Glu Gin Asn 
585 

ate ttg aac 
lie Leu Asn 
600 

aaa gaa get 
Lys Glu Ala 


aaa ttc tct 
Lys Phe Ser 


ctt gac cca 
Leu Asp Pro 
650 

aat aac tac 
Asn Asn Tyr 
665 

ttt gga ttt 
Phe Gly Phe 
680 

ggc ttt gag 
Gly Phe Glu 


cca gac agt 
Pro Asp Ser 


gat ggt gtc 
Asp Gly Val 
730 

gat aaa cat 
Asp Lys His 
745 

aag etg att 
Lys Leu lie 
760 

tac etc cge 
Tyr Leu Arg 


555 

cag gea gat 
Gin Ala Asp 


gag caa gtg 
Glu Gin Val 


tea gaa gaa 
Ser Glu Glu 
605 

ctg aaa gaa 
Leu Lys Glu 
620 

egg aac tat 
Arg Asn Tyr 
635 

gcc tea gcc 
Ala Ser Ala 


ctt cct aaa 
Leu Pro Lys 


get tea get 
Ala Ser Ala 
685 

cca aca ttg 
Pro Thr Leu 
700 

gtc aac aaa 
Val Asn Lys 
715 

tct aag gtc 
Ser Lys Val 


gag cag gat 
Glu Gin Asp 


aaa gat ttg 
Lys Asp Leu 
765 

ate ttg gga 
lie Leu Gly 
780 


att 1850 
He 


aag 189 8 

Lys 

590 

ttg 1946 
Leu 


tct 1994 
Ser 


caa 2042 
Gin 


aaa 2090 
Lys 


gaa 213 8 

Glu 

670 

gac 2186 
Asp 


gaa 22 34 

Glu 


get 2282 
Ala 


tta 2330 
Leu 


atg 2378 

Met 

750 

aaa 2426 
Lys 


gag 2 4 74 

Glu 
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gag ctt ggt ttt gcc agt etc cat gac etc cag etc ctg gga aag etg 2522 
Glu Leu Gly Phe Ala Ser Leu His Asp Leu Gin Leu Leu Gly Lys Leu 
785 790 795 

ctt ctg atg ggt gcc cgc act ctg cag ggg ate ecc cag atg att gga 2570 
Leu Leu Met Gly Ala Arg Thr Leu Gin Gly lie Pro Gin Met lie Gly 
800 805 810 

gag gtc ate agg aag ggc tea aag aat gac ttt ttt ctt cac tac ate 2618 
Glu Val lie Arg Lys Gly Ser Lys Asn Asp Phe Phe Leu His Tyr lie 
815 820 825 830 

ttc atg gag aat gee ttt gaa etc cec act gga get gga tta cag ttg 2666 
Phe Met Glu Asn Ala Phe Glu Leu Pro Thr Gly Ala Gly Leu Gin Leu 
835 840 845 

caa ata tet tea tet gga gtc att get cec gga gcc aag get gga gta 2714 
Gin lie Ser Ser Ser Gly Val lie Ala Pro Gly Ala Lys Ala Gly Val 
850 855 860 

aaa etg gaa gta gcc aac atg cag get gaa etg gtg gea aaa cec tee 2762 
Lys Leu Glu Val Ala Asn Met Gin Ala Glu Leu Val Ala Lys Pro Ser 
865 870 875 

gtg tet gtg gag ttt gtg aca aat atg ggc ate ate att ccg gac ttc 2810 
Val Ser Val Glu Phe Val Thr Asn Met Gly He He He Pro Asp Phe 
880 885 890 

get agg agt ggg gtc cag atg aac ace aac ttc ttc cac gag teg ggt 2858 
Ala Arg Ser Gly Val Gin Met Asn Thr Asn Phe Phe His Glu Ser Gly 
895 900 905 910 

ctg gag get cat gtt gcc eta aaa get ggg aag ctg aag ttt ate att 2906 
Leu Glu Ala His Val Ala Leu Lys Ala Gly Lys Leu Lys Phe He He 
915 920 925 

cct tee cea aag aga cea gtc aag ctg etc agt gga ggc aac aca tta 2954 
Pro Ser Pro Lys Arg Pro Val Lys Leu Leu Ser Gly Gly Asn Thr Leu 
930 935 940 

eat ttg gtc tet ace ace aaa aeg gag gtg ate cea cct etc att gag 3002 
His Leu Val Ser Thr Thr Lys Thr Glu Val He Pro Pro Leu He Glu 
945 950 955 

aac agg cag tec tgg tea gtt tgc aag caa gtc ttt cct ggc ctg aat 3050 
Asn Arg Gin Ser Trp Ser Val Cys Lys Gin Val Phe Pro Gly Leu Asn 
960 965 970 

tac tgc ace tea ggc get tac tee aac gcc age tec aca gac tee gee 3098 
Tyr Cys Thr Ser Gly Ala Tyr Ser Asn Ala Ser Ser Thr Asp Ser Ala 
975 980 985 990 

tec tac tat ccg etg ace ggg gac ace aga tta gag etg gaa etg agg 3146 
Ser Tyr Tyr Pro Leu Thr Gly Asp Thr Arg Leu Glu Leu Glu Leu Arg 
995 1000 1005 

cct aca gga gag att gag cag tat tet gtc age gea ace tat gag etc 3194 
Pro Thr Gly Glu He Glu Gin Tyr Ser Val Ser Ala Thr Tyr Glu Leu 


wo 02/072604 


PCT/US02/06728 


-44- 


1010 1015 1020 

cag aga gag gac aga gcc ttg gtg gat acc ctg aag ttt gta act caa 3242 
Gin Arg Glu Asp Arg Ala Leu Val Asp Thr Leu Lys Phe Val Thr Gin 
1025 1030 1035 

gca gaa ggt gcg aag cag act gag get acc atg aca ttc aaa tat aat 3290 
Ala Glu Gly Ala Lys Gin Thr Glu Ala Thr Met Thr Phe Lys Tyr Asn 
1040 1045 1050 

egg cag agt atg acc ttg tec agt gaa gtc caa att ccg gat ttt gat 3338 
Arg Gin Ser Met Thr Leu Ser Ser Glu Val Gin lie Pro Asp Phe Asp 
1055 1060 1065 1070 

gtt gac etc gga aca ate etc aga gtt aat gat gaa tct act gag ggc 3386 
Val Asp Leu Gly Thr lie Leu Arg Val Asn Asp Glu Ser Thr Glu Gly 
1075 1080 1085 

aaa acg tct tac aga etc acc ctg gac att cag aac aag aaa att act 3434 
Lys Thr Ser Tyr Arg Leu Thr Leu Asp He Gin Asn Lys Lys He Thr 
1090 1095 1100 

gag gtc gee etc atg ggc cac eta agt tgt gac aca aag gaa gaa aga 3482 
Glu Val Ala Leu Met Gly His Leu Ser Cys Asp Thr Lys Glu Glu Arg 
1105 1110 1115 

aaa ate aag ggt gtt att tee ata eee egt ttg caa gca gaa gee aga 3530 
Lys He Lys Gly Val He Ser He Pro Arg Leu Gin Ala Glu Ala Arg 
1120 1125 1130 

agt gag ate etc gee cac tgg teg cct gcc aaa ctg ctt etc caa atg 3578 
Ser Glu He Leu Ala His Trp Ser Pro Ala Lys Leu Leu Leu Gin Met 
1135 1140 1145 1150 

gac tea tct get aca get tat ggc tec aca gtt tee aag agg gtg gca 3626 
Asp Ser Ser Ala Thr Ala Tyr Gly Ser Thr Val Ser Lys Arg Val Ala 
1155 1160 1165 

tgg eat tat gat gaa gag aag att gaa ttt gaa tgg aac aca ggc ace 3674 
Trp His Tyr Asp Glu Glu Lys He Glu Phe Glu Trp Asn Thr Gly Thr 
1170 1175 1180 

aat gta gat ace aaa aaa atg act tec aat ttc cct gtg gat etc tec 3722 
Asn Val Asp Thr Lys Lys Met Thr Ser Asn Phe Pro Val Asp Leu Ser 
1185 1190 1195 

gat tat cct aag age ttg eat atg tat get aat aga etc ctg gat cac 3770 
Asp Tyr Pro Lys Ser Leu His Met Tyr Ala Asn Arg Leu Leu Asp His 
1200 1205 1210 

aga gtc cct gaa aca gac atg act ttc egg cac gtg ggt tec aaa tta 3818 
Arg Val Pro Glu Thr Asp Met Thr Phe Arg His Val Gly Ser Lys Leu 
1215 1220 1225 1230 

ata gtt gca atg age tea tgg ctt cag aag gca tct ggg agt ctt cct 3866 
He Val Ala Met Ser Ser Trp Leu Gin Lys Ala Ser Gly Ser Leu Pro 
1235 1240 1245 
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tat acc cag act ttg caa gac cac etc aat age etg aag gag ttc aae 3914 
Tyr Thr Gin Thr Leu Gin Asp His Leu Asn Ser Leu Lys Glu Phe Asn 
1250 1255 1260 

etc cag aac atg gga ttg cca gac ttc eac ate eca gaa aac etc ttc 3962 
Leu Gin Asn Met Gly Leu Pro Asp Phe His lie Pro Glu Asn Leu Phe 
1265 1270 1275 

tta aaa age gat ggc egg gtc aaa tat acc ttg aac aag aac agt ttg 4010 
Leu Lys Ser Asp Gly Arg Val Lys Tyr Thr Leu Asn Lys Asn Ser Leu 
1280 1285 1290 

aaa att gag att cct ttg cet ttt ggt ggc aaa tec tec aga gat eta 4058 
Lys lie Glu lie Pro Leu Pro Phe Gly Gly Lys Ser Ser Arg Asp Leu 
1295 1300 1305 1310 

aag atg tta gag act gtt agg aca cca gee etc cac ttc aag tct gtg 4106 
Lys Met Leu Glu Thr Val Arg Thr Pro Ala Leu His Phe Lys Ser Val 
1315 1320 1325 

gga ttc cat etg cca tct ega gag ttc caa gtc cct act ttt acc att 4154 
Gly Phe His Leu Pro Ser Arg Glu Phe Gin Val Pro Thr Phe Thr lie 
1330 1335 1340 

ccc aag ttg tat caa etg caa gtg cct etc etg ggt gtt eta gac etc 4202 
Pro Lys Leu Tyr Gin Leu Gin Val Pro Leu Leu Gly Val Leu Asp Leu 
1345 1350 1355 

tee aeg aat gtc tac age aac ttg tac aac tgg tee gee tec tac agt 4250 
Ser Thr Asn Val Tyr Ser Asn Leu Tyr Asn Trp Ser Ala Ser Tyr Ser 
1360 1365 1370 

99t ggc aac acc age aca gac cat ttc age ctt egg get cgt tac cac 4298 
Gly Gly Asn Thr Ser Thr Asp His Phe Ser Leu Arg Ala Arg Tyr His 
1375 1380 1385 1390 

atg aag get gac tct gtg gtt gac etg ctt tec tac aat gtg caa gga 4346 
Met Lys Ala Asp Ser Val Val Asp Leu Leu Ser Tyr Asn Val Gin Gly 
1395 1400 1405 

tct gga gaa aca aca tat gac cac aag aat aeg ttc aca eta tea tgt 4394 
Ser Gly Glu Thr Thr Tyr Asp His Lys Asn Thr Phe Thr Leu Ser Cys 
1410 1415 1420 

gat ggg tct eta cgc cac aaa ttt eta gat teg aat ate aaa ttc agt 4442 
Asp Gly Ser Leu Arg His Lys Phe Leu Asp Ser Asn lie Lys Phe Ser 
1425 1430 1435 

eat gta gaa aaa ctt gga aac aac cca gtc tea aaa ggt tta eta ata 4490 
His Val Glu Lys Leu Gly Asn Asn Pro Val Ser Lys Gly Leu Leu lie 
1440 1445 1450 

ttc gat gca tct agt tec tgg gga cca cag atg tct get tea gtt cat 4538 
Phe Asp Ala Ser Ser Ser Trp Gly Pro Gin Met Ser Ala Ser Val His 
1455 1460 1465 1470 

ttg gac tee aaa aag aaa cag cat ttg ttt gtc aaa gaa gtc aag att 4586 
Leu Asp Ser Lys Lys Lys Gin His Leu Phe Val Lys Glu Val Lys lie 
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1475 1480 1485 

gat ggg cag ttc aga gtc tct teg ttc tat get aaa ggc aca tat ggc 4634 
Asp Gly Gin Phe Arg Val Ser Ser Phe Tyr Ala Lys Gly Thr Tyr Gly 
1490 1495 1500 

ctg tct tgt cag agg gat cot aac act ggc egg etc aat gga gag tee 4682 
Leu Ser Cys Gin Arg Asp Pro Asn Thr Gly Arg Leu Asn Gly Glu Ser 
1505 1510 1515 

aac ctg agg ttt aac tec tec tac etc caa ggc acc aac cag ata aca 4730 
Asn Leu Arg Phe Asn Ser Ser Tyr Leu Gin Gly Thr Asn Gin lie Thr 
1520 1525 1530 

gga aga tat gaa gat gga acc etc tec etc ace tec acc tct gat ctg 4778 
Gly Arg Tyr Glu Asp Gly Thr Leu Ser Leu Thr Ser Thr Ser Asp Leu 
1535 1540 1545 1550 

caa agt ggc ate att aaa aat act get tee eta aag tat gag aac tac 4826 
Gin Ser Gly lie lie Lys Asn Thr Ala Ser Leu Lys Tyr Glu Asn Tyr 
1555 1560 1565 

gag ctg act tta aaa tct gac ace aat ggg aag tat aag aac ttt gee 4874 
Glu Leu Thr Leu Lys Ser Asp Thr Asn Gly Lys Tyr Lys Asn Phe Ala 
1570 1575 1580 

act tct aac aag atg gat atg acc ttc tct aag caa aat gca ctg ctg 4922 
Thr Ser Asn Lys Met Asp Met Thr Phe Ser Lys Gin Asn Ala Leu Leu 
1585 1590 1595 

cgt tct gaa tat cag get gat tac gag tea ttg agg ttc ttc age ctg 4970 
Arg Ser Glu Tyr Gin Ala Asp Tyr Glu Ser Leu Arg Phe Phe Ser Leu 
1600 1605 1610 

ctt tct gga tea eta aat tec cat ggt ett gag tta aat get gac ate 5018 
Leu Ser Gly Ser Leu Asn Ser His Gly Leu Glu Leu Asn Ala Asp lie 
1615 1620 1625 1630 

tta ggc act gac aaa att aat agt ggt get cac aag geg aca eta agg 5066 
Leu Gly Thr Asp Lys lie Asn Ser Gly Ala His Lys Ala Thr Leu Arg 
1635 1640 1645 

att ggc caa gat gga ata tct acc agt gca acg acc aac ttg aag tgt 5114 
lie Gly Gin Asp Gly lie Ser Thr Ser Ala Thr Thr Asn Leu Lys Cys 
1650 1655 1660 

agt etc ctg gtg ctg gag aat gag ctg aat gca gag ett ggc etc tct 5162 
Ser Leu Leu Val Leu Glu Asn Glu Leu Asn Ala Glu Leu Gly Leu Ser 
1665 1670 1675 

ggg gca tct atg aaa tta aca aca aat ggc cgc ttc agg gaa cac aat 5210 
Gly Ala Ser Met Lys Leu Thr Thr Asn Gly Arg Phe Arg Glu His Asn 
1680 1685 1690 

gca aaa ttc agt ctg gat ggg aaa gee gee etc aca gag eta tea ctg 5258 
Ala Lys Phe Ser Leu Asp Gly Lys Ala Ala Leu Thr Glu Leu Ser Leu 
1695 1700 1705 1710 
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gga agt get tat cag gcc atg att ctg ggt gtc gac age aaa aac att 5306 
Gly Ser Ala Tyr Gin Ala Met lie Leu Gly Val Asp Ser Lys Asn He 
1715 1720 1725 

ttc aac ttc aag gtc agt caa gaa gga ctt aag etc tea aat gac atg 5354 
Phe Asn Phe Lys Val Ser Gin Glu Gly Leu Lys Leu Ser Asn Asp Met 
1730 1735 1740 

atg ggc tea tat get gaa atg aaa ttt gac cac aca aac agt ctg aac 5402 
Met Gly Ser Tyr Ala Glu Met Lys Phe Asp His Thr Asn Ser Leu Asn 
1745 1750 1755 

att gca ggc tta tea ctg gac ttc tct tea aaa ctt gac aac att tae 5450 
He Ala Gly Leu Ser Leu Asp Phe Ser Ser Lys Leu Asp Asn He Tyr 
1760 1765 1770 

age tct gac aag ttt tat aag caa act gtt aat tta cag eta cag cee 5498 
Ser Ser Asp Lys Phe Tyr Lys Gin Thr Val Asn Leu Gin Leu Gin Pro 
1775 1780 1785 1790 

tat tct ctg gta act act tta aac agt gac ctg aaa tac aat get ctg 5546 
Tyr Ser Leu Val Thr Thr Leu Asn Ser Asp Leu Lys Tyr Asn Ala Leu 
1795 1800 1805 

gat etc aec aac aat ggg aaa eta egg eta gaa ecc ctg aag ctg cat 5594 
Asp Leu Thr Asn Asn Gly Lys Leu Arg Leu Glu Pro Leu Lys Leu His 
1810 1815 1820 

gtg get ggt aac eta aaa gga gee tac caa aat aat gaa ata aaa cac 5642 
Val Ala Gly Asn Leu Lys Gly Ala Tyr Gin Asn Asn Glu He Lys His 
1825 1830 1835 

ate tat gee ate tct tct get gee tta tea gca age tat aaa gea gac 5690 
He Tyr Ala He Ser Ser Ala Ala Leu Ser Ala Ser Tyr Lys Ala Asp 
1840 1845 1850 

act gtt get aag gtt cag ggt gtg gag ttt age eat egg etc aac aca 5738 
Thr Val Ala Lys Val Gin Gly Val Glu Phe Ser His Arg Leu Asn Thr 
1855 1860 1865 1870 

gae ate get ggg ctg get tea gee att gac atg age aca aac tat aat 5786 
Asp He Ala Gly Leu Ala Ser Ala He Asp Met Ser Thr Asn Tyr Asn 
1875 1880 1885 

tea gac tea ctg cat ttc age aat gtc ttc egt tct gta atg gcc ccg 5834 
Ser Asp Ser Leu His Phe Ser Asn Val Phe Arg Ser Val Met Ala Pro 
1890 1895 1900 

ttt ace atg ace ate gat gca cat aca aat ggc aat ggg aaa etc get 5882 
Phe Thr Met Thr He Asp Ala His Thr Asn Gly Asn Gly Lys Leu Ala 
1905 1910 1915 

etc tgg gga gaa eat act ggg cag ctg tat age aaa ttc ctg ttg aaa 5930 
Leu Trp Gly Glu His Thr Gly Gin Leu Tyr Ser Lys Phe Leu Leu Lys 
1920 1925 1930 

gca gaa cct ctg gea ttt act ttc tct cat gat tac aaa ggc tec aca 5978 
Ala Glu Pro Leu Ala Phe Thr Phe Ser His Asp Tyr Lys Gly Ser Thr 
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1935 1940 1945 1950 

agt cat cat etc gtg tct agg aaa age ate agt gca get ett gaa cac 6026 
Ser His His Leu Val Ser Arg Lys Ser lie Ser Ala Ala Leu Glu His 
1955 1960 1965 

aaa gte agt gee ctg ett act cca get gag cag aca ggc ace tgg aaa 6074 
Lys Val Ser Ala Leu Leu Thr Pro Ala Glu Gin Thr Gly Thr Trp Lys 
1970 1975 1980 

etc aag ace caa ttt aac aac aat gaa tac age cag gac ttg gat get 6122 
Leu Lys Thr Gin Phe Asn Asn Asn Glu Tyr Ser Gin Asp Leu Asp Ala 
1985 1990 1995 

tac aac act aaa gat aaa att ggc gtg gag ett act gga cga act ctg 6170 
Tyr Asn Thr Lys Asp Lys lie Gly Val Glu Leu Thr Gly Arg Thr Leu 
2000 2005 2010 

get gac eta act eta eta gac tec cca att aaa gtg cca ett tta etc 6218 
Ala Asp Leu Thr Leu Leu Asp Ser Pro lie Lys Val Pro Leu Leu Leu 
2015 2020 2025 2030 

agt gag ecc ate aat ate att gat get tta gag atg aga gat gee gtt 6266 
Ser Glu Pro lie Asn lie lie Asp Ala Leu Glu Met Arg Asp Ala Val 
2035 2040 2045 

gag aag ecc caa gaa ttt aca att gtt get ttt gta aag tat gat aaa 6314 
Glu Lys Pro Gin Glu Phe Thr He Val Ala Phe Val Lys Tyr Asp Lys 
2050 2055 2060 

aac caa gat gtt cac tee att aac etc cca ttt ttt gag ace ttg caa 6362 
Asn Gin Asp Val His Ser He Asn Leu Pro Phe Phe Glu Thr Leu Gin 
2065 2070 2075 

gaa tat ttt gag agg aat cga caa ace att ata gtt gta gtg gaa aac 6410 
Glu Tyr Phe Glu Arg Asn Arg Gin Thr He He Val Val Val Glu Asn 
2080 2085 2090 

gta cag aga aac ctg aag cac ate aat att gat caa ttt gta aga aaa 6458 
Val Gin Arg Asn Leu Lys His He Asn He Asp Gin Phe Val Arg Lys 
2095 2100 2105 2110 

tac aga gca gee ctg gga aaa etc cca cag caa get aat gat tat ctg 6506 
Tyr Arg Ala Ala Leu Gly Lys Leu Pro Gin Gin Ala Asn Asp Tyr Leu 
2115 2120 2125 

aat tea tte aat tgg gag aga caa gtt tea cat gee aag gag aaa ctg 6554 
Asn Ser Phe Asn Trp Glu Arg Gin Val Ser His Ala Lys Glu Lys Leu 
2130 2135 2140 

act get etc aca aaa aag tat aga att aca gaa aat gat ata caa att 6602 
Thr Ala Leu Thr Lys Lys Tyr Arg He Thr Glu Asn Asp He Gin He 
2145 2150 2155 

gca tta gat gat gee aaa ate aac ttt aat gaa aaa eta tct caa ctg 6650 
Ala Leu Asp Asp Ala Lys He Asn Phe Asn Glu Lys Leu Ser Gin Leu 
2160 2165 2170 
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cag aca tat atg ata caa ttt gat cag tat att aaa gat agt tat gat 6698 
Gin Thr Tyr Met lie Gin Phe Asp Gin Tyr lie Lys Asp Ser Tyr Asp 
2175 2180 2185 2190 

tta cat gat ttg aaa ata get att get aat att att gat gaa ate att 6746 
Leu His Asp Leu Lys lie Ala lie Ala Asn lie lie Asp Glu lie lie 
2195 2200 2205 

gaa aaa tta aaa agt ctt gat gag cac tat cat ate cgt gta aat tta 6794 
Glu Lys Leu Lys Ser Leu Asp Glu His Tyr His lie Arg Val Asn Leu 
2210 2215 2220 

gta aaa aca ate cat gat eta cat ttg ttt att gaa aat att gat ttt 6842 
Val Lys Thr lie His Asp Leu His Leu Phe lie Glu Asn lie Asp Phe 
2225 2230 2235 

aac aaa agt gga agt agt act gea tec tgg att caa aat gtg gat act 6890 
Asn Lys Ser Gly Ser Ser Thr Ala Ser Trp lie Gin Asn Val Asp Thr 
2240 2245 2250 

aag tae caa ate aga ate cag ata caa gaa aaa ctg cag cag ctt aag 6938 
Lys Tyr Gin lie Arg lie Gin He Gin Glu Lys Leu Gin Gin Leu Lys 
2255 2260 2265 2270 

aga cac ata cag aat ata gae ate eag cac eta get gga aag tta aaa 6986 
Arg His lie Gin Asn He Asp He Gin His Leu Ala Gly Lys Leu Lys 
2275 2280 2285 

caa cac att gag get att gat gtt aga gtg ctt tta gat caa ttg gga 7034 
Gin His He Glu Ala He Asp Val Arg Val Leu Leu Asp Gin Leu Gly 
2290 2295 2300 

act aca att tea ttt gaa aga ata aat gat gtt ctt gag eat gtc aaa 7082 
Thr Thr He Ser Phe Glu Arg He Asn Asp Val Leu Glu His Val Lys 
2305 2310 2315 

cac ttt gtt ata aat ctt att ggg gat ttt gaa gta get gag aaa ate 7130 
His Phe Val He Asn Leu He Gly Asp Phe Glu Val Ala Glu Lys He 
2320 2325 2330 

aat gee ttc aga gee aaa gtc cat gag tta ate gag agg tat gaa gta 7178 
Asn Ala Phe Arg Ala Lys Val His Glu Leu He Glu Arg Tyr Glu Val 
2335 2340 2345 2350 

gae caa caa ate eag gtt tta atg gat aaa tta gta gag ttg ace cac 7226 
Asp Gin Gin He Gin Val Leu Met Asp Lys Leu Val Glu Leu Thr His 
2355 2360 2365 

caa tae aag ttg aag gag act att cag aag eta age aat gtc eta caa 7274 
Gin Tyr Lys Leu Lys Glu Thr He Gin Lys Leu Ser Asn Val Leu Gin 
2370 2375 2380 

caa gtt aag ata aaa gat tae ttt gag aaa ttg gtt gga ttt att gat 7322 
Gin Val Lys He Lys Asp Tyr Phe Glu Lys Leu Val Gly Phe He Asp 
2385 2390 2395 

gat get gtg aag aag ctt aat gaa tta tct ttt aaa aca ttc att gaa 7370 
Asp Ala Val Lys Lys Leu Asn Glu Leu Ser Phe Lys Thr Phe He Glu 
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2400 2405 2410 

gat gtt aac aaa ttc ctt gac atg ttg ata aag aaa tta aag tea ttt 7418 
Asp Val Asn Lys Phe Leu Asp Met Leu lie Lys Lys Leu Lys Ser Phe 
2415 2420 2425 2430 

gat tac cac cag ttt gta gat gaa acc aat gac aaa ate cgt gag gtg 7466 
Asp Tyr His Gin Phe Val Asp Glu Thr Asn Asp Lys lie Arg Glu Val 
2435 2440 2445 

act cag aga etc aat ggt gaa att cag get etg gaa eta eea eaa aaa 7514 
Thr Gin Arg Leu Asn Gly Glu lie Gin Ala Leu Glu Leu Pro Gin Lys 
2450 2455 2460 

get gaa gea tta aaa etg ttt tta gag gaa ace aag gee aca gtt gca 7562 
Ala Glu Ala Leu Lys Leu Phe Leu Glu Glu Thr Lys Ala Thr Val Ala 
2465 2470 2475 

gtg tat etg gaa age eta cag gac ace aaa ata ace tta ate ate aat 7610 
Val Tyr Leu Glu Ser Leu Gin Asp Thr Lys lie Thr Leu lie lie Asn 
2480 2485 2490 

tgg tta cag gag get tta agt tea gca tct ttg get eae atg aag gee 7658 
Trp Leu Gin Glu Ala Leu Ser Ser Ala Ser Leu Ala His Met Lys Ala 
2495 2500 2505 2510 

aaa tte ega gag act eta gaa gat aca ega gac cga atg tat eaa atg 7706 
Lys Phe Arg Glu Thr Leu Glu Asp Thr Arg Asp Arg Met Tyr Gin Met 
2515 2520 2525 

gae att cag cag gaa ett eaa cga tac etg tct etg gta gge cag gtt 7754 
Asp lie Gin Gin Glu Leu Gin Arg Tyr Leu Ser Leu Val Gly Gin Val 
2530 2535 2540 

tat age aca ctt gtc ace tac att tct gat tgg tgg act ett get get 7802 
Tyr Ser Thr Leu Val Thr Tyr lie Ser Asp Trp Trp Thr Leu Ala Ala 
2545 2550 2555 

aag aae ett act gac ttt gca gag eaa tat tet ate eaa gat tgg get 7850 
Lys Asn Leu Thr Asp Phe Ala Glu Gin Tyr Ser lie Gin Asp Trp Ala 
2560 2565 2570 

aaa cgt atg aaa gca ttg gta gag eaa ggg tte act gtt eet gaa ate 7898 
Lys Arg Met Lys Ala Leu Val Glu Gin Gly Phe Thr Val Pro Glu lie 
2575 2580 2585 2590 

aag acc ate ett ggg ace atg ect gee ttt gaa gtc agt ett cag get 7946 
Lys Thr lie Leu Gly Thr Met Pro Ala Phe Glu Val Ser Leu Gin Ala 
2595 2600 2605 

ett cag aaa get acc tte cag aca ect gat ttt ata gtc eee eta aca 7994 
Leu Gin Lys Ala Thr Phe Gin Thr Pro Asp Phe lie Val Pro Leu Thr 
2610 2615 2620 

gat ttg agg att eca tea gtt cag ata aac tte aaa gae tta aaa aat 8042 
Asp Leu Arg lie Pro Ser Val Gin lie Asn Phe Lys Asp Leu Lys Asn 
2625 2630 2635 
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ata aaa ate cca tec agg ttt tec aca eea gaa ttt aec ate ett aae 8090 
lie Lys lie Pro Ser Arg Phe Ser Thr Pro Glu Phe Thr lie Leu Asn 
2640 2645 2650 

ace tte cae att eet tee ttt aea att gac ttt gtc gaa atg aaa gta 8138 
Thr Phe His lie Pro Ser Phe Thr lie Asp Phe Val Glu Met Lys Val 
2655 2660 2665 2670 

aag ate ate aga aec att gae cag atg eag aae agt gag ctg cag tgg 8186 
Lys lie lie Arg Thr lie Asp Gin Met Gin Asn Ser Glu Leu Gin Trp 
2675 2680 2685 

cec gtt cca gat ata tat etc agg gat ctg aag gtg gag gac att eet 8234 
Pro Val Pro Asp lie Tyr Leu Arg Asp Leu Lys Val Glu Asp lie Pro 
2690 2695 2700 

eta gcg aga ate ace ctg cca gae tte cgt tta cca gaa ate gca att 8282 
Leu Ala Arg lie Thr Leu Pro Asp Phe Arg Leu Pro Glu lie Ala lie 
2705 2710 2715 

cca gaa tte ata ate cca act etc aae ett aat gat ttt caa gtt eet 8330 
Pro Glu Phe lie lie Pro Thr Leu Asn Leu Asn Asp Phe Gin Val Pro 
2720 2725 2730 

gae ett cac ata cca gaa tte eag ett cec cae ate tea cae aca att 8378 
Asp Leu His lie Pro Glu Phe Gin Leu Pro His lie Ser His Thr lie 
2735 2740 2745 2750 

gaa gta eet act ttt ggc aag eta tac agt att ctg aaa ate caa tct 8426 
Glu Val Pro Thr Phe Gly Lys Leu Tyr Ser lie Leu Lys lie Gin Ser 
2755 2760 2765 

cct ett tte aca tta gat gca aat get gae ata ggg aat gga ace aec 8474 
Pro Leu Phe Thr Leu Asp Ala Asn Ala Asp lie Gly Asn Gly Thr Thr 
2770 2775 2780 

tea gca aae gaa gca ggt ate gca get tec ate act gee aaa gga gag 8522 
Ser Ala Asn Glu Ala Gly lie Ala Ala Ser lie Thr Ala Lys Gly Glu 
2785 2790 2795 

tec aaa tta gaa gtt etc aat ttt gat ttt caa gca aat gca caa etc 8570 
Ser Lys Leu Glu Val Leu Asn Phe Asp Phe Gin Ala Asn Ala Gin Leu 
2800 2805 2810 

tea aae cct aag att aat ecg ctg get ctg aag gag tea gtg aag tte 8618 
Ser Asn Pro Lys lie Asn Pro Leu Ala Leu Lys Glu Ser Val Lys Phe 
2815 2820 2825 2830 

tec age aag tac ctg aga acg gag cat ggg agt gaa atg ctg ttt ttt 8666 
Ser Ser Lys Tyr Leu Arg Thr Glu His Gly Ser Glu Met Leu Phe Phe 
2835 2840 2845 

gga aat get att gag gga aaa tea aac aea gtg gca agt tta cac aca 8714 
Gly Asn Ala lie Glu Gly Lys Ser Asn Thr Val Ala Ser Leu His Thr 
2850 2855 2860 

gaa aaa aat aca ctg gag ett agt aat gga gtg att gtc aag ata aae 8762 
Glu Lys Asn Thr Leu Glu Leu Ser Asn Gly Val lie Val Lys lie Asn 
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2865 2870 2875 

aat cag ctt acc ctg gat age aac act aaa tac ttc cac aaa ttg aac 8810 
Asn Gin Leu Thr Leu Asp Ser Asn Thr Lys Tyr Phe His Lys Leu Asn 
2880 2885 2890 

ate ccc aaa ctg gac ttc tct agt cag get gac ctg cgc aac gag ate 8858 
lie Pro Lys Leu Asp Phe Ser Ser Gin Ala Asp Leu Arg Asn Glu lie 
2895 2900 2905 2910 

aag aca ctg ttg aaa get gge cae ata gca tgg act tct tct gga aaa 8906 
Lys Thr Leu Leu Lys Ala Gly His lie Ala Trp Thr Ser Ser Gly Lys 
2915 2920 2925 

999 tea tgg aaa tgg gcc tgc ccc aga ttc tea gat gag gga aca cat 8954 
Gly Ser Trp Lys Trp Ala Cys Pro Arg Phe Ser Asp Glu Gly Thr His 
2930 2935 2940 

gaa tea caa att agt ttc ace ata gaa gga ccc etc act tec ttt gga 9002 
Glu Ser Gin lie Ser Phe Thr lie Glu Gly Pro Leu Thr Ser Phe Gly 
2945 2950 2955 

ctg tec aat aag ate aat age aaa cae eta aga gta aac caa aac ttg 9050 
Leu Ser Asn Lys lie Asn Ser Lys His Leu Arg Val Asn Gin Asn Leu 
2960 2965 2970 

gtt tat gaa tct gge tec etc aac ttt tct aaa ctt gaa att caa tea 9098 
Val Tyr Glu Ser Gly Ser Leu Asn Phe Ser Lys Leu Glu lie Gin Ser 
2975 2980 2985 2990 

caa gte gat tec cag eat gtg gge eac agt gtt eta act get aaa gge 9146 
Gin Val Asp Ser Gin His Val Gly His Ser Val Leu Thr Ala Lys Gly 
2995 3000 3005 

atg gca ctg ttt gga gaa ggg aag gea gag ttt act ggg agg eat gat 9194 
Met Ala Leu Phe Gly Glu Gly Lys Ala Glu Phe Thr Gly Arg His Asp 
3010 3015 3020 

get cat tta aat gga aag gtt att gga act ttg aaa aat tct ctt ttc 9242 
Ala His Leu Asn Gly Lys Val lie Gly Thr Leu Lys Asn Ser Leu Phe 
3025 3030 3035 

ttt tea gee cag eca ttt gag ate aeg gca tec aca aac aat gaa ggg 9290 
Phe Ser Ala Gin Pro Phe Glu lie Thr Ala Ser Thr Asn Asn Glu Gly 
3040 3045 3050 

aat ttg aaa gtt egt ttt cea tta agg tta aca ggg aag ata gac ttc 9338 
Asn Leu Lys Val Arg Phe Pro Leu Arg Leu Thr Gly Lys lie Asp Phe 
3055 3060 3065 3070 

ctg aat aac tat gea ctg ttt ctg agt ccc agt gee eag caa gea agt 9386 
Leu Asn Asn Tyr Ala Leu Phe Leu Ser Pro Ser Ala Gin Gin Ala Ser 
3075 3080 3085 

tgg caa gta agt get agg ttc aat cag tat aag tac aac caa aat ttc 9434 
Trp Gin Val Ser Ala Arg Phe Asn Gin Tyr Lys Tyr Asn Gin Asn Phe 
3090 3095 3100 
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tct get gga aac aac gag aac att atg gag gcc cat gta gga ata aat 9482 
Ser Ala Gly Asn Asn Glu Asn lie Met Glu Ala His Val Gly lie Asn 
3105 3110 3115 

gga gaa gca aat ctg gat ttc tta aac att cct tta aca att cct gaa 9530 
Gly Glu Ala Asn Leu Asp Phe Leu Asn lie Pro Leu Thr lie Pro Glu 
3120 3125 3130 

atg cgt eta cct tac aca ata ate aca act cct cca ctg aaa gat ttc 9578 
Met Arg Leu Pro Tyr Thr lie lie Thr Thr Pro Pro Leu Lys Asp Phe 
3135 3140 3145 3150 

tct eta tgg gaa aaa aca ggc ttg aag gaa ttc ttg aaa acg aca aag 9626 
Ser Leu Trp Glu Lys Thr Gly Leu Lys Glu Phe Leu Lys Thr Thr Lys 
3155 3160 3165 

eaa tea ttt gat tta agt gta aaa get eag tat aag aaa aac aaa eac 9674 
Gin Ser Phe Asp Leu Ser Val Lys Ala Gin Tyr Lys Lys Asn Lys His 
3170 3175 3180 

agg eat tec ate aca aat cct ttg get gtg ett tgt gag ttt ate agt 9722 
Arg His Ser lie Thr Asn Pro Leu Ala Val Leu Cys Glu Phe lie Ser 
3185 3190 3195 

cag age ate aaa tec ttt gac agg cat ttt gaa aaa aac aga aac aat 9770 
Gin Ser lie Lys Ser Phe Asp Arg His Phe Glu Lys Asn Arg Asn Asn 
3200 3205 3210 

gca tta gat ttt gtc ace aaa tec tat aat gaa aca aaa att aag ttt 9818 
Ala Leu Asp Phe Val Thr Lys Ser Tyr Asn Glu Thr Lys lie Lys Phe 
3215 3220 3225 3230 

gat aag tac aaa get gaa aaa tct cac gae gag etc cec agg ace ttt 9866 
Asp Lys Tyr Lys Ala Glu Lys Ser His Asp Glu Leu Pro Arg Thr Phe 
3235 3240 3245 

eaa att cct gga tac act gtt cca gtt gtc aat gtt gaa gtg tct cca 9914 
Gin lie Pro Gly Tyr Thr Val Pro Val Val Asn Val Glu Val Ser Pro 
3250 3255 3260 

ttc ace ata gag atg teg gca ttc ggc tat gtg ttc cca aaa gca gtc 9962 
Phe Thr lie Glu Met Ser Ala Phe Gly Tyr Val Phe Pro Lys Ala Val 
3265 3270 3275 

age atg cct agt ttc tec ate eta ggt tct gae gtc cgt gtg cct tea 10010 
Ser Met Pro Ser Phe Ser He Leu Gly Ser Asp Val Arg Val Pro Ser 
3280 3285 3290 

tac aca tta ate ctg cca tea tta gag ctg cca gtc ett cat gtc cct 10058 
Tyr Thr Leu He Leu Pro Ser Leu Glu Leu Pro Val Leu His Val Pro 
3295 3300 3305 3310 

aga aat etc aag ett tct ett cca cat ttc aag gaa ttg tgt ace ata 10106 
Arg Asn Leu Lys Leu Ser Leu Pro His Phe Lys Glu Leu Cys Thr He 
3315 3320 3325 


age cat att ttt att cct gcc atg ggc aat att acc tat gat ttc tec 10154 
Ser His He Phe He Pro Ala Met Gly Asn He Thr Tyr Asp Phe Ser 
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3330 3335 3340 

ttt aaa tea agt gtc ate aca ctg aat acc aat get gaa ctt ttt aac 10202 
Phe Lys Ser Ser Val lie Thr Leu Asn Thr Asn Ala Glu Leu Phe Asn 
3345 3350 3355 

cag tea gat att gtt get eat ete ctt tct tea tct tea tet gte att 10250 
Gin Ser Asp lie Val Ala His Leu Leu Ser Ser Ser Ser Ser Val lie 
3360 3365 3370 

gat gca etg cag tac aaa tta gag ggc acc aca aga ttg aca aga aaa 10298 
Asp Ala Leu Gin Tyr Lys Leu Glu Gly Thr Thr Arg Leu Thr Arg Lys 
3375 3380 3385 3390 

agg gga ttg aag tta gee aca get ctg tet etg age aac aaa ttt gtg 10346 
Arg Gly Leu Lys Leu Ala Thr Ala Leu Ser Leu Ser Asn Lys Phe Val 
3395 3400 3405 

9^9 99t agt cat aac agt act gtg age tta acc acg aaa aat atg gaa 10394 
Glu Gly Ser His Asn Ser Thr Val Ser Leu Thr Thr Lys Asn Met Glu 
3410 3415 3420 

gtg tea gtg gca aaa acc aca aaa gee gaa att cca att ttg aga atg 10442 
Val Ser Val Ala Lys Thr Thr Lys Ala Glu lie Pro lie Leu Arg Met 
3425 3430 3435 

aat ttc aag eaa gaa ctt aat gga aat acc aag tea aaa ect act gtc 10490 
Asn Phe Lys Gin Glu Leu Asn Gly Asn Thr Lys Ser Lys Pro Thr Val 
3440 3445 3450 

tct tec tec atg gaa ttt aag tat gat ttc aat tct tea atg ctg tac 10538 
Ser Ser Ser Met Glu Phe Lys Tyr Asp Phe Asn Ser Ser Met Leu Tyr 
3455 3460 3465 3470 

tct acc get aaa gga gca gtt gac cac aag ctt age ttg gaa age etc 10586 
Ser Thr Ala Lys Gly Ala Val Asp His Lys Leu Ser Leu Glu Ser Leu 
3475 3480 3485 

acc tct tac ttt tec att gag tea tct acc aaa gga gat gtc aag ggt 10634 
Thr Ser Tyr Phe Ser lie Glu Ser Ser Thr Lys Gly Asp Val Lys Gly 
3490 3495 3500 

teg gtt ctt tet egg gaa tat tea gga act att get agt gag gee aac 10682 
Ser Val Leu Ser Arg Glu Tyr Ser Gly Thr lie Ala Ser Glu Ala Asn 
3505 3510 3515 

act tac ttg aat tec aag age aca egg tct tea gtg aag ctg eag ggc 10730 
Thr Tyr Leu Asn Ser Lys Ser Thr Arg Ser Ser Val Lys Leu Gin Gly 
3520 3525 3530 

act tec aaa att gat gat ate tgg aac ctt gaa gta aaa gaa aat ttt 10778 
Thr Ser Lys lie Asp Asp lie Trp Asn Leu Glu Val Lys Glu Asn Phe 
3535 3540 3545 3550 

get gga gaa gee aca etc caa cgc ata tat tec etc tgg gag cac agt 10826 
Ala Gly Glu Ala Thr Leu Gin Arg lie Tyr Ser Leu Trp Glu His Ser 
3555 3560 3565 
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acg aaa aac cac tta cag eta gag ggc etc ttt ttc acc aac gga gaa 10874 
Thr Lys Asn His Leu Gin Leu Glu Gly Leu Phe Phe Thr Asn Gly Glu 
3570 3575 3580 

cat aca age aaa gee acc ctg gaa etc tct cca tgg caa atg tea get 10922 
His Thr Ser Lys Ala Thr Leu Glu Leu Ser Pro Trp Gin Met Ser Ala 
3585 3590 3595 

ctt gtt cag gte cat gca agt cag ecc agt tec ttc cat gat ttc cet 10970 
Leu Val Gin Val His Ala Ser Gin Pro Ser Ser Phe His Asp Phe Pro 
3600 3605 3610 

gac ctt ggc cag gaa gtg gee ctg aat get aac act aag aac cag aag 11018 
Asp Leu Gly Gin Glu Val Ala Leu Asn Ala Asn Thr Lys Asn Gin Lys 
3615 3620 3625 3630 

ate aga tgg aaa aat gaa gte egg att eat tet ggg tct ttc cag age 11066 
lie Arg Trp Lys Asn Glu Val Arg He His Ser Gly Ser Phe Gin Ser 
3635 3640 3645 

cag gte gag ctt tec aat gac caa gaa aag gca cac ctt gac att gca 11114 
Gin Val Glu Leu Ser Asn Asp Gin Glu Lys Ala His Leu Asp He Ala 
3650 3655 3660 

gga tec tta gaa gga cac eta agg ttc etc aaa aat ate ate eta cca 11162 
Gly Ser Leu Glu Gly His Leu Arg Phe Leu Lys Asn He He Leu Pro 
3665 3670 3675 

gte tat gac aag age tta tgg gat ttc eta aag ctg gat gta acc acc 11210 
Val Tyr Asp Lys Ser Leu Trp Asp Phe Leu Lys Leu Asp Val Thr Thr 
3680 3685 3690 

age att ggt agg aga cag cat ctt egt gtt tea act gee ttt gtg tae 11258 
Ser He Gly Arg Arg Gin His Leu Arg Val Ser Thr Ala Phe Val Tyr 
3695 3700 3705 3710 

acc aaa aac ecc aat ggc tat tea ttc tec ate cet gta aaa gtt ttg 11306 
Thr Lys Asn Pro Asn Gly Tyr Ser Phe Ser He Pro Val Lys Val Leu 
3715 3720 3725 

get gat aaa ttc att act cet ggg ctg aaa eta aat gat eta aat tea 11354 
Ala Asp Lys Phe He Thr Pro Gly Leu Lys Leu Asn Asp Leu Asn Ser 
3730 3735 3740 

gtt ctt gte atg cet acg ttc cat gte cca ttt aca gat ctt cag gtt 11402 
Val Leu Val Met Pro Thr Phe His Val Pro Phe Thr Asp Leu Gin Val 
3745 3750 3755 

cca teg tgc aaa ctt gac ttc aga gaa ata caa ate tat aag aag ctg 11450 
Pro Ser Cys Lys Leu Asp Phe Arg Glu He Gin He Tyr Lys Lys Leu 
3760 3765 3770 

aga act tea tea ttt gee etc aac eta cca aca etc ecc gag gta aaa 11498 
Arg Thr Ser Ser Phe Ala Leu Asn Leu Pro Thr Leu Pro Glu Val Lys 
3775 3780 3785 3790 

ttc cet gaa gtt gat gtg tta aca aaa tat tct caa cca gaa gac tec 11546 
Phe Pro Glu Val Asp Val Leu Thr Lys Tyr Ser Gin Pro Glu Asp Ser 
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3795 3800 3805 

ttg att ccc ttt ttt gag ata acc gtg cct gaa tct cag tta act gtg 11594 
Leu lie Pro Phe Phe Glu lie Thr Val Pro Glu Ser Gin Leu Thr Val 
3810 3815 3820 

tec cag ttc acg ctt cca aaa agt gtt tea gat ggc att get get ttg 11642 
Ser Gin Phe Thr Leu Pro Lys Ser Val Ser Asp Gly lie Ala Ala Leu 
3825 3830 3835 

gat eta aat gca gta gcc aac aag ate gca gae ttt gag ttg ccc acc 11690 
Asp Leu Asn Ala Val Ala Asn Lys lie Ala Asp Phe Glu Leu Pro Thr 
3840 3845 3850 

ate ate gtg cct gag cag acc att gag att ccc tec att aag ttc tct 11738 
lie lie Val Pro Glu Gin Thr lie Glu lie Pro Ser He Lys Phe Ser 
3855 3860 3865 3870 

gta cct get gga att gtc att cct tec ttt caa gca ctg act gca egc 11786 
Val Pro Ala Gly He Val He Pro Ser Phe Gin Ala Leu Thr Ala Arg 
3875 3880 3885 

ttt gag gta gae tct ccc gtg tat aat gcc act tgg agt gcc agt ttg 11834 
Phe Glu Val Asp Ser Pro Val Tyr Asn Ala Thr Trp Ser Ala Ser Leu 
3890 3895 3900 

aaa aac aaa gca gat tat gtt gaa aca gtc ctg gat tec aca tgc age 11882 
Lys Asn Lys Ala Asp Tyr Val Glu Thr Val Leu Asp Ser Thr Cys Ser 
3905 3910 3915 

tea acc gta cag ttc eta gaa tat gaa eta aat gtt ttg gga aca cac 11930 
Ser Thr Val Gin Phe Leu Glu Tyr Glu Leu Asn Val Leu Gly Thr His 
3920 3925 3930 

aaa ate gaa gat ggt acg tta gee tct aag act aaa gga aca ctt gca 11978 
Lys He Glu Asp Gly Thr Leu Ala Ser Lys Thr Lys Gly Thr Leu Ala 
3935 3940 3945 3950 

cac cgt gae ttc agt gca gaa tat gaa gaa gat ggc aaa ttt gaa gga 12026 
His Arg Asp Phe Ser Ala Glu Tyr Glu Glu Asp Gly Lys Phe Glu Gly 
3955 3960 3965 

ctt cag gaa tgg gaa gga aaa gcg cac etc aat ate aaa age cca geg 12074 
Leu Gin Glu Trp Glu Gly Lys Ala His Leu Asn He Lys Ser Pro Ala 
3970 3975 3980 

ttc ace gat etc cat ctg egc tac cag aaa gae aag aaa ggc ate tec 12122 
Phe Thr Asp Leu His Leu Arg Tyr Gin Lys Asp Lys Lys Gly He Ser 
3985 3990 3995 

acc tea gca gee tec cca gcc gta ggc ace gtg ggc atg gat atg gat 12170 
Thr Ser Ala Ala Ser Pro Ala Val Gly Thr Val Gly Met Asp Met Asp 
4000 4005 4010 

gaa gat gae gae ttt tct aaa tgg aac ttc tac tac age cct cag tee 12218 
Glu Asp Asp Asp Phe Ser Lys Trp Asn Phe Tyr Tyr Ser Pro Gin Ser 
4015 4020 4025 4030 
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tct cca gat aaa aaa etc acc ata ttc aaa act gag ttg agg gtc egg 12266 

Ser Pro Asp Lys Lys Leu Thr lie Phe Lys Thr Glu Leu Arg Val Arg 

4035 4040 4045 

gaa tct gat gag gaa act cag ate aaa gtt aat tgg gaa gaa gag gca 12314 

Glu Ser Asp Glu Glu Thr Gin lie Lys Val Asn Trp Glu Glu Glu Ala 
4050 4055 4060 

get tct ggc ttg eta acc tct ctg aaa gac aac gtg ccc aag gee aca 12362 

Ala Ser Gly Leu Leu Thr Ser Leu Lys Asp Asn Val Pro Lys Ala Thr 
4065 4070 4075 

999 9tc ctt tat gat tat gtc aac aag tac cac tgg gaa cac aca ggg 12410 

Gly Val Leu Tyr Asp Tyr Val Asn Lys Tyr His Tirp Glu His Thr Gly 
4080 4085 4090 

etc ace ctg aga gaa gtg tct tea aag ctg aga aga aat ctg cag aac 12458 

Leu Thr Leu Arg Glu Val Ser Ser Lys Leu Arg Arg Asn Leu Gin Asn 

4095 4100 4105 4110 

aat get gag tgg gtt tat caa ggg gee att agg caa att gat gat ate 12506 

Asn Ala Glu Trp Val Tyr Gin Gly Ala lie Arg Gin lie Asp Asp lie 

4115 4120 4125 

gac gtg agg ttc cag aaa gca gee agt ggc ace act ggg acc tac caa 12554 

Asp Val Arg Phe Gin Lys Ala Ala Ser Gly Thr Thr Gly Thr Tyr Gin 
4130 4135 4140 

gag tgg aag gac aag gee cag aat ctg tac cag gaa ctg ttg act cag 12602 

Glu Trp Lys Asp Lys Ala Gin Asn Leu Tyr Gin Glu Leu Leu Thr Gin 
4145 4150 4155 

gaa ggc caa gcc agt ttc cag gga etc aag gat aac gtg ttt gat ggc 12650 

Glu Gly Gin Ala Ser Phe Gin Gly Leu Lys Asp Asn Val Phe Asp Gly 
4160 4165 4170 

ttg gta cga gtt act caa aaa ttc eat atg aaa gtc aag cat ctg att 12698 

Leu Val Arg Val Thr Gin Lys Phe His Met Lys Val Lys His Leu lie 

4175 4180 4185 4190 

gac tea etc att gat ttt ctg aac ttc ccc aga ttc cag ttt ceg ggg 12746 

Asp Ser Leu lie Asp Phe Leu Asn Phe Pro Arg Phe Gin Phe Pro Gly 

4195 4200 4205 

aaa ect ggg ata tac act agg gag gaa ctt tgc act atg ttc ata agg 12794 

Lys Pro Gly lie Tyr Thr Arg Glu Glu Leu Cys Thr Met Phe lie Arg 
4210 * 4215 4220 

gag gta ggg acg gta ctg tee cag gta tat teg aaa gtc cat aat ggt 12842 

Glu Val Gly Thr Val Leu Ser Gin Val Tyr Ser Lys Val His Asn Gly 
4225 4230 4235 

tea gaa ata ctg ttt tec tat ttc caa gac eta gtg att aca ctt cct 12890 

Ser Glu lie Leu Phe Ser Tyr Phe Gin Asp Leu Val lie Thr Leu Pro 
4240 4245 4250 

ttc gag tta agg aaa cat aaa eta ata gat gta ate teg atg tat agg 12938 

Phe Glu Leu Arg Lys His Lys Leu lie Asp Val lie Ser Met Tyr Arg 
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4255 4260 4265 4270 

gaa ctg ttg aaa gat tta tea aaa gaa gcc caa gag gta ttt aaa gcc 12986 
Glu Leu Leu Lys Asp Leu Ser Lys Glu Ala Gin Glu Val Phe Lys Ala 
4275 4280 4285 

att cag tct etc aag acc aea gag gtg eta egt aat ctt eag gac ctt 13034 
lie Gin Ser Leu Lys Thr Thr Glu Val Leu Arg Asn Leu Gin Asp Leu 
4290 4295 4300 

tta caa ttcatt ttc caa eta ata gaa gat aac att aaa cag ctg aaa 13082 
Leu Gin Phe lie Phe Gin Leu lie Glu Asp Asn lie Lys Gin Leu Lys 
4305 4310 4315 

gag atg aaa ttt act tat ctt att aat tat ate caa gat gag ate aac 13130 
Glu Met Lys Phe Thr Tyr Leu lie Asn Tyr lie Gin Asp Glu lie Asn 
4320 4325 4330 

aca ate ttc aat gat tat ate eca tat gtt ttt aaa ttg ttg aaa gaa 13178 
Thr lie Phe Asn Asp Tyr lie Pro Tyr Val Phe Lys Leu Leu Lys Glu 
4335 4340 4345 4350 

aac eta tgc ctt aat ctt eat aag ttc aat gaa ttt att caa aac gag 13226 
Asn Leu Cys Leu Asn Leu His Lys Phe Asn Glu Phe lie Gin Asn Glu 
4355 4360 4365 

ctt cag gaa get tct caa gag tta cag cag ate eat caa tac att atg 13274 
Leu Gin Glu Ala Ser Gin Glu Leu Gin Gin lie His Gin Tyr lie Met 
4370 4375 4380 

gcc ctt egt gaa gaa tat ttt gat eca agt ata gtt ggc tgg aca gtg 13322 
Ala Leu Arg Glu Glu Tyr Phe Asp Pro Ser lie Val Gly Trp Thr Val 
4385 4390 4395 

aaa tat tat gaa ctt gaa gaa aag ata gtc agt ctg ate aag aac ctg 13370 
Lys Tyr Tyr Glu Leu Glu Glu Lys lie Val Ser Leu lie Lys Asn Leu 
4400 4405 4410 

tta gtt get ctt aag gac ttc cat tct gaa tat att gtc agt gcc tct 13418 
Leu Val Ala Leu Lys Asp Phe His Ser Glu Tyr lie Val Ser Ala Ser 
4415 4420 4425 4430 

aac ttt act tec caa etc tea agt caa gtt gag caa ttt ctg cac aga 13466 
Asn. Phe Thr Ser Gin Leu Ser Ser Gin Val Glu Gin Phe Leu His Arg 
4435 4440 4445 

aat att cag gaa tat ctt age ate ctt ace gat eca gat gga aaa ggg 13514 
Asn lie Gin Glu Tyr Leu Ser lie Leu Thr Asp Pro Asp Gly Lys Gly 
4450 4455 4460 

aaa gag aag att gea gag ctt tct gee act get cag gaa ata att aaa 13562 
Lys Glu Lys lie Ala Glu Leu Ser Ala Thr Ala Gin Glu lie lie Lys 
4465 4470 4475 

age eag gee att gcg acg aag aaa ata att tct gat tac cac eag cag 13610 
Ser Gin Ala lie Ala Thr Lys Lys lie lie Ser Asp Tyr His Gin Gin 
4480 4485 4490 
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ttt aga tat aaa ctg caa gat ttt tea gac caa etc tct gat tac tat 13658 
Phe Arg Tyr Lys Leu Gin Asp Phe Ser Asp Gin Leu Ser Asp Tyr Tyr 
4495 4500 4505 4510 

gaa aaa ttt att get gaa tee aaa aga ttg att gac ctg tec att caa 13706 
Glu Lys Phe lie Ala Glu Ser Lys Arg Leu lie Asp Leu Ser lie Gin 
4515 4520 4525 

aac tac cac aca ttt ctg ata tac ate acg gag tta ctg aaa aag ctg 13754 
Asn Tyr His Thr Phe Leu lie Tyr lie Thr Glu Leu Leu Lys Lys Leu 
4530 4535 4540 

caa tea ace aca gtc atg aac ccc tac atg aag ctt get cea gga gaa 13802 
Gin Ser Thr Thr Val Met Asn Pro Tyr Met Lys Leu Ala Pro Gly Glu 
4545 4550 4555 

ctt act ate ate etc taa ttttttaaaa gaaatettea tttattctte 13850 
Leu Thr lie lie Leu * 
4560 

ttttccaatt gaactttcac atagcacaga aaaaatteaa aetgectata ttgataaaac 13910 

catacagtga gecagcettg cagtaggcag tagaetataa geagaagcac atatgaactg 13970 

gaectgeace aaagetggca ccagggctcg gaaggtctet gaactcagaa ggatggcatt 14 030 

ttttgeaagt taaagaaaat caggatetga gttattttgc taaacttggg ggaggaggaa 14 090 

caaataaatg gagtctttat tgtgtatcat a 14121 


<210> 32 

<211> 4563 

<212> PRT 

<213> Homo sapien 


<400> 32 


Met 

Asp 

Pro 

Pro 

Arg 

Pro 

Ala 

Leu 

1 




5 




Leu 

Leu 

Leu 

Leu 

Leu 

Leu 

Ala 

Gly 




20 





Glu 

Asn 

Val 

Ser 

Leu 

Val 

Cys 

Pro 



35 





40 

Leu 

Arg 

Lys 

Tyr 

Thr 

Tyr 

Asn 

Tyr 


50 





55 


Pro 

Gly 

Thr 

Ala 

Asp 

Ser 

Arg 

Ser 

65 





70 



Glu 

Leu 

Glu 

Val 

Pro 

Gin 

Leu 

Cys 





85 




Cys 

Thr 

Leu 

Lys 

Glu 

Val 

Tyr 

Gly 




100 





Leu 

Lys 

Lys 

Thr 

Lys 

Asn 

Ser 

Glu 



115 





120 

Tyr 

Glu 

Leu 

Lys 

Leu 

Ala 

He 

Pro 

130 





135 


Pro 

Glu 

Lys 

Asp 

Glu 

Pro 

Thr 

Tyr 

145 





150 



He 

Ser 

Ala 

Leu 

Leu 

Val 

Pro 

Pro 





165 




Leu 

Phe 

Leu 

Asp 

Thr 

Val 

Tyr 

Gly 




180 





Lys 

Thr 

Arg 

Lys 

Gly 

Asn 

Val 

Ala 



195 





200 


Leu 

Ala 

Leu 

Leu 

Ala 

Leu 

Pro 

Ala 


10 





15 


Ala 

Arg 

Ala 

Glu 

Glu 

Glu 

Met 

Leu 

25 





30 



Lys 

Asp 

Ala 

Thr 

Arg 

Phe 

Lys 

His 





45 




Glu 

Ala 

Glu 

Ser 

Ser 

Ser 

Gly 

Val 




60 





Ala 

Thr 

Arg 

He 

Asn 

Cys 

Lys 

Val 



75 





80 

Ser 

Phe 

He 

Leu 

Lys 

Thr 

Ser 

Gin 


90 





95 


Phe 

Asn 

Pro 

Glu 

Gly 

Lys 

Ala 

Leu 

105 





110 



Glu 

Phe 

Ala 

Ala 

Ala 

Met 

Ser 

Arg 





125 




Glu 

Gly 

Lys 

Gin 

Val 

Phe 

Leu 

Tyr 




140 





He 

Leu 

Asn 

He 

Lys 

Arg 

Gly 

He 



155 





160 

Glu 

Thr 

Glu 

Glu 

Ala 

Lys 

Gin 

Val 


170 





175 


Asn 

Cys 

Ser 

Thr 

His 

Phe 

Thr 

Val 

185 





190 



Thr 

Glu 

He 

Ser 

Thr 

Glu 

Arg 

Asp 





205 
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Leu 

Gly 

Gin 

Cys 

Asp 

Arg 

Phe 

Lys 

Pro 

He 

Arg 

Thr 

Gly 

He 

Ser 

Pro 


210 





215 





220 





Leu 

Ala 

Leu 

He 

Lys 

Gly 

Met 

Thr 

Arg 

Pro 

Leu 

Ser 

Thr 

Leu 

He 

Ser 

225 





230 





235 





240 

Ser 

Ser 

Gin 

Ser 

Cys 

Gin 

Tyr 

Thr 

Leu 

Asp 

Ala 

Lys 

Arg 

Lys 

His 

Val 





245 





250 





255 


Ala 

Glu 

Ala 

He 

Cys 

Lys 

Glu 

Gin 

His 

Leu 

Phe 

Leu 

Pro 

Phe 

Ser 

Tyr 




260 





265 





270 



Asn 

Asn 

Lys 

Tyr 

Gly 

Met 

Val 

Ala 

Gin 

Val 

Thr 

Gin 

Thr 

Leu 

Lys 

Leu 



275 





280 





285 




Glu 

Asp 

Thr 

Pro 

Lys 

He 

Asn 

Ser 

Arg 

Phe 

Phe 

Gly 

Glu 

Gly 

Thr 

Lys 


290 





295 





300 





Lys 

Met 

Gly 

Leu 

Ala 

Phe 

Glu 

Ser 

Thr 

Lys 

Ser 

Thr 

Ser 

Pro 

Pro 

Lys 

305 





310 





315 





320 

Gin 

Ala 

Glu 

Ala 

Val 

Leu 

Lys 

Thr 

Leu 

Gin 

Glu 

Leu 

Lys 

Lys 

Leu 

Thr 





325 





330 





335 


He 

Ser 

Glu 

Gin 

Asn 

He 

Gin 

Arg 

Ala 

Asn 

Leu 

Phe 

Asn 

Lys 

Leu 

Val 




340 





345 





350 



Thr 

Glu 

Leu 

Arg 

Gly 

Leu 

Ser 

Asp 

Glu 

Ala 

Val 

Thr 

Ser 

Leu 

Leu 

Pro 



355 





360 





365 




Gin 

Leu 

He 

Glu 

Val 

Ser 

Ser 

Pro 

He 

Thr 

Leu 

Gin 

Ala 

Leu 

Val 

Gin 


370 





375 





380 





Cys 

Gly 

Gin 

Pro 

Gin 

Cys 

Ser 

Thr 

His 

He 

Leu 

Gin 

Trp 

Leu 

Lys 

Arg 

385 





390 





395 





400 

Val 

His 

Ala 

Asn 

Pro 

Leu 

Leu 

He 

Asp 

Val 

Val 

Thr 

Tyr 

Leu 

Val 

Ala 





405 





410 





415 


Leu 

He 

Pro 

Glu 

Pro 

Ser 

Ala 

Gin 

Gin 

Leu 

Arg 

Glu 

He 

Phe 

Asn 

Met 




420 





425 





430 



Ala 

Arg 

Asp 

Gin 

Arg 

Ser 

Arg 

Ala 

Thr 

Leu 

Tyr 

Ala 

Leu 

Ser 

His 

Ala 



435 





440 





445 




Val 

Asn 

Asn 

Tyr 

His 

Lys 

Thr 

Asn 

Pro 

Thr 

Gly 

Thr 

Gin 

Glu 

Leu 

Leu 


450 





455 





460 





Asp 

He 

Ala 

Asn 

Tyr 

Leu 

Met 

Glu 

Gin 

He 

Gin 

Asp 

Asp 

Cys 

Thr 

Gly 

465 





470 





475 





480 

Asp 

Glu 

Asp 

Tyr 

Thr 

Tyr 

Leu 

He 

Leu 

Arg 

Val 

He 

Gly 

Asn 

Met 

Gly 





485 





490 





495 


Gin 

Thr 

Met 

Glu 

Gin 

Leu 

Thr 

Pro 

Glu 

Leu 

Lys 

Ser 

Ser 

He 

Leu 

Lys 




500 





505 





510 



Cys 

Val 

Gin 

Ser 

Thr 

Lys 

Pro 

Ser 

Leu 

Met 

He 

Gin 

Lys 

Ala 

Ala 

He 



515 





520 





525 




Gin 

Ala 

Leu 

Arg 

Lys 

Met 

Glu 

Pro 

Lys 

Asp 

Lys 

Asp 

Gin 

Glu 

Val 

Leu 


530 





535 





540 





Leu 

Gin 

Thr 

Phe 

Leu 

Asp 

Asp 

Ala 

Ser 

Pro 

Gly Asp 

Lys 

Arg 

Leu 

Ala 

545 





550 





555 





560 

Ala 

Tyr 

Leu 

Met 

Leu 

Met 

Arg 

Ser 

Pro 

Ser 

Gin 

Ala 

Asp 

He 

Asn 

Lys 





565 





570 





575 


He 

Val 

Gin 

He 

Leu 

Pro 

Trp 

Glu 

Gin 

Asn 

Glu 

Gin 

Val 

Lys 

Asn 

Phe 




580 





585 





590 



Val 

Ala 

Ser 

His 

He 

Ala 

Asn 

He 

Leu 

Asn 

Ser 

Glu 

Glu 

Leu 

Asp 

He 



595 





600 





605 



Gin 

Asp 

Leu 

Lys 

Lys 

Leu 

Val 

Lys 

Glu 

Ala 

Leu 

Lys 

Glu 

Ser 

Gin 

Leu 


610 





615 





620 





Pro 

Thr 

Val 

Met 

Asp 

Phe 

Arg 

Lys 

Phe 

Ser 

Arg 

Asn 

Tyr 

Gin 

Leu 

Tyr 

625 





630 





635 





640 

Lys 

Ser 

Val 

Ser 

Leu 

Pro 

Ser 

Leu 

Asp 

Pro 

Ala 

Ser 

Ala 

Lys 

He 

Glu 




645 





650 





655 


Gly 

Asn 

Leu 

He 

Phe 

Asp 

Pro 

Asn 

Asn 

Tyr 

Leu 

Pro 

Lys 

Glu 

Ser 

Met 




660 





665 





670 
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Leu 

Lys 

Thr 

Thr 

Leu 

Thr 

Ala 

Phe 

Gly 

Phe 

Ala 

Ser 

Ala 

Asp 

Leu 

He 


675 





680 





685 




Glu 

He 

Gly 

Leu 

Glu 

Gly 

Lys 

Gly 

Phe 

Glu 

Pro 

Thr 

Leu 

Glu 

Ala 

Leu 


690 





695 





700 





Phe 

Gly 

Lys 

Gin 

Gly 

Phe 

Phe 

Pro 

Asp 

Ser 

Val 

Asn 

Lys 

Ala 

Leu 

Tyr 

705 





710 





715 





720 

Trp 

Val 

Asn 

Gly 

Gin 

Val 

Pro 

Asp 

Gly 

Val 

Ser 

Lys 

Val 

Leu 

Val 

Asp 





725 





730 





735 


His 

Phe 

Gly 

Tyr 

Thr 

Lys 

Asp 

Asp 

Lys 

His 

Glu 

Gin 

Asp 

Met 

Val 

Asn 




740 





745 





750 



Gly 

He 

Met 

Leu 

Ser 

Val 

Glu 

Lys 

Leu 

He 

Lys 

Asp 

Leu 

Lys 

Ser 

Lys 



755 





760 





765 




Glu 

Val 

Pro 

Glu 

Ala 

Arg 

Ala 

Tyr 

Leu 

Arg 

He 

Leu 

Gly 

Glu 

Glu 

Leu 


770 





775 





780 





Gly 

Phe 

Ala 

Ser 

Leu 

His 

Asp 

Leu 

Gin 

Leu 

Leu 

Gly 

Lys 

Leu 

Leu 

Leu 

785 





790 





795 





800 

Met 

Gly 

Ala 

Arg 

Thr 

Leu 

Gin 

Gly 

He 

Pro 

Gin 

Met 

He 

Gly Glu 

Val 





805 





810 





815 


He 

Arg 

Lys 

Gly 

Ser 

Lys 

Asn 

Asp 

Phe 

Phe 

Leu 

His 

Tyr 

He 

Phe 

Met 




820 





825 





830 



Glu 

Asn 

Ala 

Phe 

Glu 

Leu 

Pro 

Thr 

Gly 

Ala 

Gly 

Leu 

Gin 

Leu 

Gin 

He 



835 





840 





845 




Ser 

Ser 

Ser 

Gly Val 

He 

Ala 

Pro 

Gly 

Ala 

Lys 

Ala 

Gly 

Val 

Lys 

Leu 


850 





855 





860 





Glu 

Val 

Ala 

Asn 

Met 

Gin 

Ala 

Glu 

Leu 

Val 

Ala 

Lys 

Pro 

Ser 

Val 

Ser 

865 





870 





875 





880 

Val 

Glu 

Phe 

Val 

Thr 

Asn 

Met 

Gly 

He 

He 

He 

Pro 

Asp 

Phe 

Ala 

Arg 





885 





890 





895 


Ser 

Gly 

Val 

Gin 

Met 

Asn 

Thr 

Asn 

Phe 

Phe 

His 

Glu 

Ser 

Gly 

Leu 

Glu 



900 





905 





910 



Ala 

His 

Val 

Ala 

Leu 

Lys 

Ala 

Gly 

Lys 

Leu 

Lys 

Phe 

He 

He 

Pro 

Ser 



915 





920 





925 




Pro 

Lys 

Arg 

Pro 

Val 

Lys 

Leu 

Leu 

Ser 

Gly 

Gly 

Asn 

Thr 

Leu 

His 

Leu 


930 





935 





940 





Val 

Ser 

Thr 

Thr 

Lys 

Thr 

Glu 

Val 

He 

Pro 

Pro 

Leu 

He 

Glu 

Asn 

Arg 

945 





950 





955 





960 

Gin 

Ser 

Trp 

Ser 

Val 

Cys 

Lys 

Gin 

Val 

Phe 

Pro 

Gly 

Leu 

Asn 

Tyr 

Cys 





965 





970 





975 


Thr 

Ser 

Gly 

Ala 

Tyr 

Ser 

Asn 

Ala 

Ser 

Ser 

Thr 

Asp 

Ser 

Ala 

Ser 

Tyr 




980 





985 





990 



Tyr 

Pro 

Leu 

Thr 

Gly 

Asp 

Thr 

Arg 

Leu 

Glu 

Leu 

Glu 

Leu 

Arg 

Pro 

Thr 



995 





1000 




1005 



Gly 

Glu 

He 

Glu 

Gin 

Tyr 

Ser 

Val 

Ser 

Ala 

Thr 

Tyr 

Glu 

Leu 

Gin 

Arg 

1010 




1015 




1020 




Glu 

Asp 

Arg 

Ala 

Leu 

Val 

Asp 

Thr 

Leu 

Lys 

Phe 

Val 

Thr 

Gin 

Ala 

Glu 

1025 




1030 




1035 




1041 

Gly 

Ala 

Lys 

Gin 

Thr 

Glu 

Ala 

Thr 

Met 

Thr 

Phe 

Lys 

Tyr 

Asn 

Arg 

Gin 





1045 




1050 




1055 

Ser 

Met 

Thr 

Leu 

Ser 

Ser 

Glu 

Val 

Gin 

He 

Pro 

Asp 

Phe 

Asp 

Val 

Asp 




1060 




1065 




1070 


Leu 

Gly 

Thr 

He 

Leu 

Arg 

Val 

Asn 

Asp 

Glu 

Ser 

Thr 

Glu 

Gly 

Lys 

Thr 



1075 




1080 




1085 



Ser 

Tyr 

Arg 

Leu 

Thr 

Leu 

Asp 

He 

Gin 

Asn 

Lys 

Lys 

He 

Thr 

Glu 

Val 


1090 




1095 




1100 




Ala 

Leu 

Met 

Gly 

His 

Leu 

Ser 

Cys 

Asp 

Thr 

Lys 

Glu 

Glu 

Arg 

Lys 

He 

1105 




1110 




1115 




1121 

Lys 

Gly 

Val 

He 

Ser 

He 

Pro 

Arg 

Leu 

Gin 

Ala 

Glu 

Ala 

Arg 

Ser 

Glu 


1125 1130 1135 
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Ile Leu Ala His Trp Ser Pro Ala Lys Leu Leu Leu Gin Met Asp Ser 

1140 1145 1150 

Ser Ala Thr Ala Tyr Gly Ser Thr Val Ser Lys Arg Val Ala Trp His 

1155 1160 1165 

Tyr Asp Glu Glu Lys lie Glu Phe Glu Trp Asn Thr Gly Thr Asn Val 

1170 1175 1180 

Asp Thr Lys Lys Met Thr Ser Asn Phe Pro Val Asp Leu Ser Asp Tyr 
1185 1190 1195 1200 

Pro Lys Ser Leu His Met Tyr Ala Asn Arg Leu Leu Asp His Arg Val 

1205 1210 1215 

Pro Glu Thr Asp Met Thr Phe Arg His Val Gly Ser Lys Leu lie Val 

1220 1225 1230 

Ala Met Ser Ser Trp Leu Gin Lys Ala Ser Gly Ser Leu Pro Tyr Thr 

1235 1240 1245 

Gin Thr Leu Gin Asp His Leu Asn Ser Leu Lys Glu Phe Asn Leu Gin 

1250 1255 1260 

Asn Met Gly Leu Pro Asp Phe His lie Pro Glu Asn Leu Phe Leu Lys 
1265 1270 1275 1280 

Ser Asp Gly Arg Val Lys Tyr Thr Leu Asn Lys Asn Ser Leu Lys lie 

1285 1290 1295 

Glu lie Pro Leu Pro Phe Gly Gly Lys Ser Ser Arg Asp Leu Lys Met 

1300 1305 1310 

Leu Glu Thr Val Arg Thr Pro Ala Leu His Phe Lys Ser Val Gly Phe 

1315 1320 1325 

His Leu Pro Ser Arg Glu Phe Gin Val Pro Thr Phe Thr lie Pro Lys 

1330 1335 1340 

Leu Tyr Gin Leu Gin Val Pro Leu Leu Gly Val Leu Asp Leu Ser Thr 
1345 1350 1355 1360 

Asn Val Tyr Ser Asn Leu Tyr Asn Trp Ser Ala Ser Tyr Ser Gly Gly 

1365 1370 1375 

Asn Thr Ser Thr Asp His Phe Ser Leu Arg Ala Arg Tyr His Met Lys 

1380 1385 1390 

Ala Asp Ser Val Val Asp Leu Leu Ser Tyr Asn Val Gin Gly Ser Gly 

1395 1400 1405 

Glu Thr Thr Tyr Asp His Lys Asn Thr Phe Thr Leu Ser Cys Asp Gly 

1410 1415 1420 

Ser Leu Arg His Lys Phe Leu Asp Ser Asn lie Lys Phe Ser His Val 
1425 1430 1435 1440 

Glu Lys Leu Gly Asn Asn Pro Val Ser Lys Gly Leu Leu lie Phe Asp 

1445 1450 1455 

Ala Ser Ser Ser Trp Gly Pro Gin Met Ser Ala Ser Val His Leu Asp 

1460 1465 1470 

Ser Lys Lys Lys Gin His Leu Phe Val Lys Glu Val Lys lie Asp Gly 

1475 1480 1485 

Gin Phe Arg Val Ser Ser Phe Tyr Ala Lys Gly Thr Tyr Gly Leu Ser 

1490 1495 1500 

Cys Gin Arg Asp Pro Asn Thr Gly Arg Leu Asn Gly Glu Ser Asn Leu 
1505 1510 1515 1520 

Arg Phe Asn Ser Ser Tyr Leu Gin Gly Thr Asn Gin lie Thr Gly Arg 

1525 1530 1535 

Tyr Glu Asp Gly Thr Leu Ser Leu Thr Ser Thr Ser Asp Leu Gin Ser 

1540 1545 1550 

Gly lie lie Lys Asn Thr Ala Ser Leu Lys Tyr Glu Asn Tyr Glu Leu 

1555 1560 1565 

Thr Leu Lys Ser Asp Thr Asn Gly Lys Tyr Lys Asn Phe Ala Thr Ser 

1570 1575 1580 

Asn Lys Met Asp Met Thr Phe Ser Lys Gin Asn Ala Leu Leu Arg Ser 
1585 1590 1595 1600 
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Glu Tyr Gin Ala Asp Tyr Glu Ser Leu Arg Phe Phe Ser Leu Leu Ser 

1605 1610 1615 

Gly Ser Leu Asn Ser His Gly Leu Glu Leu Asn Ala Asp lie Leu Gly 

1620 1625 1630 

Thr Asp Lys lie Asn Ser Gly Ala His Lys Ala Thr Leu Arg He Gly 

1635 1640 1645 

Gin Asp Gly He Ser Thr Ser Ala Thr Thr Asn Leu Lys Cys Ser Leu 

1650 1655 , 1660 

Leu Val Leu Glu Asn Glu Leu Asn Ala Glu Leu Gly Leu Ser Gly Ala 
1665 1670 1675 1680 

Ser Met Lys Leu Thr Thr Asn Gly Arg Phe Arg Glu His Asn Ala Lys 

1685 1690 1695 

Phe Ser Leu Asp Gly Lys Ala Ala Leu Thr Glu Leu Ser Leu Gly Ser 

1700 1705 1710 

Ala Tyr Gin Ala Met He Leu Gly Val Asp Ser Lys Asn He Phe Asn 

1715 1720 1725 

Phe Lys Val Ser Gin Glu Gly Leu Lys Leu Ser Asn Asp Met Met Gly 

1730 1735 1740 

Ser Tyr Ala Glu Met Lys Phe Asp His Thr Asn Ser Leu Asn He Ala 
1745 1750 1755 1760 

Gly Leu Ser Leu Asp Phe Ser Ser Lys Leu Asp Asn He Tyr Ser Ser 

1765 1770 1775 

Asp Lys Phe Tyr Lys Gin Thr Val Asn Leu Gin Leu Gin Pro Tyr Ser 

1780 1785 1790 

Leu Val Thr Thr Leu Asn Ser Asp Leu Lys Tyr Asn Ala Leu Asp Leu 

1795 1800 1805 

Thr Asn Asn Gly Lys Leu Arg Leu Glu Pro Leu Lys Leu His Val Ala 

1810 1815 1820 

Gly Asn Leu Lys Gly Ala Tyr Gin Asn Asn Glu He Lys His He Tyr 
1825 1830 1835 1840 

Ala He Ser Ser Ala Ala Leu Ser Ala Ser Tyr Lys Ala Asp Thr Val 

1845 1850 1855 

Ala Lys Val Gin Gly Val Glu Phe Ser His Arg Leu Asn Thr Asp He 

1860 1865 1870 

Ala Gly Leu Ala Ser Ala He Asp Met Ser Thr Asn Tyr Asn Ser Asp 

1875 1880 1885 

Ser Leu His Phe Ser Asn Val Phe Arg Ser Val Met Ala Pro Phe Thr 

1890 1895 1900 

Met Thr He Asp Ala His Thr Asn Gly Asn Gly Lys Leu Ala Leu Trp 
1905 1910 1915 1920 

Gly Glu His Thr Gly Gin Leu Tyr Ser Lys Phe Leu Leu Lys Ala Glu 

1925 1930 1935 

Pro Leu Ala Phe Thr Phe Ser His Asp Tyr Lys Gly Ser Thr Ser His 

1940 1945 1950 

His Leu Val Ser Arg Lys Ser He Ser Ala Ala Leu Glu His Lys Val 

1955 1960 1965 

Ser Ala Leu Leu Thr Pro Ala Glu Gin Thr Gly Thr Trp Lys Leu Lys 

1970 1975 1980 

Thr Gin Phe Asn Asn Asn Glu Tyr Ser Gin Asp Leu Asp Ala Tyr Asn 
1985 1990 1995 2000 

Thr Lys Asp Lys He Gly Val Glu Leu Thr Gly Arg Thr Leu Ala Asp 

2005 2010 2015 

Leu Thr Leu Leu Asp Ser Pro He Lys Val Pro Leu Leu Leu Ser Glu 

2020 2025 2030 

Pro He Asn He He Asp Ala Leu Glu Met Arg Asp Ala Val Glu Lys 

2035 2040 2045 

Pro Gin Glu Phe Thr He Val Ala Phe Val Lys Tyr Asp Lys Asn Gin 
2050 2055 2060 


wo 02/072604 


PCT/US02/06728 


-64- 


Asp Val His Ser lie Asn Leu Pro Phe Phe Glu Thr Leu Gin Glu Tyr 
2065 2070 2075 2080 

Phe Glu Arg Asn Arg Gin Thr lie He Val Val Val Glu Asn Val Gin 

2085 2090 2095 

Arg Asn Leu Lys His He Asn He Asp Gin Phe Val Arg Lys Tyr Arg 

2100 2105 2110 

Ala Ala Leu Gly Lys Leu Pro Gin Gin Ala Asn Asp Tyr Leu Asn Ser 

2115 2120 2125 

Phe Asn Trp Glu Arg Gin Val Ser His Ala Lys Glu Lys Leu Thr Ala 

2130 2135 2140 

Leu Thr Lys Lys Tyr Arg He Thr Glu Asn Asp He Gin He Ala Leu 
2145 2150 2155 2160 

Asp Asp Ala Lys He Asn Phe Asn Glu Lys Leu Ser Gin Leu Gin Thr 

2165 2170 2175 

Tyr Met He Gin Phe Asp Gin Tyr He Lys Asp Ser Tyr Asp Leu His 

2180 2185 2190 

Asp Leu Lys He Ala He Ala Asn He He Asp Glu He He Glu Lys 

2195 2200 2205 

Leu Lys Ser Leu Asp Glu His Tyr His He Arg Val Asn Leu Val Lys 

2210 2215 2220 

Thr He His Asp Leu His Leu Phe He Glu Asn He Asp Phe Asn Lys 
2225 2230 2235 2240 

Ser Gly Ser Ser Thr Ala Ser Trp He Gin Asn Val Asp Thr Lys Tyr 

2245 2250 2255 

Gin He Arg He Gin He Gin Glu Lys Leu Gin Gin Leu Lys Arg His 

2260 2265 2270 

He Gin Asn He Asp He Gin His Leu Ala Gly Lys Leu Lys Gin His 

2275 2280 2285 

He Glu Ala He Asp Val Arg Val Leu Leu Asp Gin Leu Gly Thr Thr 

2290 2295 2300 

He Ser Phe Glu Arg He Asn Asp Val Leu Glu His Val Lys His Phe 
2305 2310 2315 2320 

Val He Asn Leu He Gly Asp Phe Glu Val Ala Glu Lys He Asn Ala 

2325 2330 2335 

Phe Arg Ala Lys Val His Glu Leu He Glu Arg Tyr Glu Val Asp Gin 

2340 2345 2350 

Gin He Gin Val Leu Met Asp Lys Leu Val Glu Leu Thr His Gin Tyr 

2355 2360 2365 

Lys Leu Lys Glu Thr He Gin Lys Leu Ser Asn Val Leu Gin Gin Val 

2370 2375 2380 

Lys He Lys Asp Tyr Phe Glu Lys Leu Val Gly Phe He Asp Asp Ala 
2385 2390 2395 2400 

Val Lys Lys Leu Asn Glu Leu Ser Phe Lys Thr Phe He Glu Asp Val 

2405 2410 2415 

Asn Lys Phe Leu Asp Met Leu He Lys Lys Leu Lys Ser Phe Asp Tyr 

2420 2425 2430 

His Gin Phe Val Asp Glu Thr Asn Asp Lys He Arg Glu Val Thr Gin 

2435 2440 2445 

Arg Leu Asn Gly Glu He Gin Ala Leu Glu Leu Pro Gin Lys Ala Glu 

2450 2455 2460 

Ala Leu Lys Leu Phe Leu Glu Glu Thr Lys Ala Thr Val Ala Val Tyr 
2465 2470 2475 2480 

Leu Glu Ser Leu Gin Asp Thr Lys He Thr Leu He He Asn Trp Leu 

2485 2490 2495 

Gin Glu Ala Leu Ser Ser Ala Ser Leu Ala His Met Lys Ala Lys Phe 

2500 2505 2510 

Arg Glu Thr Leu Glu Asp Thr Arg Asp Arg Met Tyr Gin Met Asp He 
2515 2520 2525 
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Gln Gin Glu Leu Gin Arg Tyr Leu Ser Leu Val Gly Gin Val Tyr Ser 

2530 2535 2540 

Thr Leu Val Thr Tyr lie Ser Asp Trp Trp Thr Leu Ala Ala Lys Asn 
2545 2550 2555 2560 

Leu Thr Asp Phe Ala Glu Gin Tyr Ser lie Gin Asp Trp Ala Lys Arg 

2565 2570 2575 

Met Lys Ala Leu Val Glu Gin Gly Phe Thr Val Pro Glu lie Lys Thr 

2580 2585 2590 

lie Leu Gly Thr Met Pro Ala Phe Glu Val Ser Leu Gin Ala Leu Gin 

2595 2600 2605 

Lys Ala Thr Phe Gin Thr Pro Asp Phe lie Val Pro Leu Thr Asp Leu 

2610 2615 2620 

Arg lie Pro Ser Val Gin lie Asn Phe Lys Asp Leu Lys Asn lie Lys 
2625 2630 2635 2640 

lie Pro Ser Arg Phe Ser Thr Pro Glu Phe Thr lie Leu Asn Thr Phe 

2645 2650 2655 

His lie Pro Ser Phe Thr lie Asp Phe Val Glu Met Lys Val Lys lie 

2660 2665 2670 

lie Arg Thr lie Asp Gin Met Gin Asn Ser Glu Leu Gin Trp Pro Val 

2675 2680 2685 

Pro Asp lie Tyr Leu Arg Asp Leu Lys Val Glu Asp lie Pro Leu Ala 

2690 2695 2700 

Arg lie Thr Leu Pro Asp Phe Arg Leu Pro Glu lie Ala lie Pro Glu 
2705 2710 2715 2720 

Phe lie lie Pro Thr Leu Asn Leu Asn Asp Phe Gin Val Pro Asp Leu 

2725 2730 2735 

His lie Pro Glu Phe Gin Leu Pro His lie Ser His Thr lie Glu Val 

2740 2745 2750 

Pro Thr Phe Gly Lys Leu Tyr Ser lie Leu Lys lie Gin Ser Pro Leu 

2755 2760 2765 

Phe Thr Leu Asp Ala Asn Ala Asp lie Gly Asn Gly Thr Thr Ser Ala 

2770 2775 2780 

Asn Glu Ala Gly lie Ala Ala Ser lie Thr Ala Lys Gly Glu Ser Lys 
2785 2790 2795 2800 

Leu Glu Val Leu Asn Phe Asp Phe Gin Ala Asn Ala Gin Leu Ser Asn 

2805 2810 2815 

Pro Lys lie Asn Pro Leu Ala Leu Lys Glu Ser Val Lys Phe Ser Ser 

2820 2825 2830 

Lys Tyr Leu Arg Thr Glu His Gly Ser Glu Met Leu Phe Phe Gly Asn 

2835 2840 2845 

Ala lie Glu Gly Lys Ser Asn Thr Val Ala Ser Leu His Thr Glu Lys 

2850 2855 2860 

Asn Thr Leu Glu Leu Ser Asn Gly Val lie Val Lys lie Asn Asn Gin 
2865 2870 2875 2880 

Leu Thr Leu Asp Ser Asn Thr Lys Tyr Phe His Lys Leu Asn lie Pro 

2885 2890 2895 

Lys Leu Asp Phe Ser Ser Gin Ala Asp Leu Arg Asn Glu lie Lys Thr 

2900 2905 2910 

Leu Leu Lys Ala Gly His lie Ala Trp Thr Ser Ser Gly Lys Gly Ser 

2915 2920 2925 

Trp Lys Trp Ala Cys Pro Arg Phe Ser Asp Glu Gly Thr His Glu Ser 

2930 2935 2940 

Gin lie Ser Phe Thr lie Glu Gly Pro Leu Thr Ser Phe Gly Leu Ser 
2945 2950 2955 2960 

Asn Lys lie Asn Ser Lys His Leu Arg Val Asn Gin Asn Leu Val Tyr 

2965 2970 2975 

Glu Ser Gly Ser Leu Asn Phe Ser Lys Leu Glu lie Gin Ser Gin Val 
2980 2985 2990 
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Asp Ser Gin His Val Gly His Ser Val Leu Thr Ala Lys Gly Met Ala 

2995 3000 3005 

Leu Phe Gly Glu Gly Lys Ala Glu Phe Thr Gly Arg His Asp Ala His 

3010 3015 3020 

Leu Asn Gly Lys Val lie Gly Thr Leu Lys Asn Ser Leu Phe Phe Ser 
3025 3030 3035 3040 

Ala Gin Pro Phe Glu lie Thr Ala Ser Thr Asn Asn Glu Gly Asn Leu 

3045 3050 3055 

Lys Val Arg Phe Pro Leu Arg Leu Thr Gly Lys lie Asp Phe Leu Asn 

3060 3065 3070 

Asn Tyr Ala Leu Phe Leu Ser Pro Ser Ala Gin Gin Ala Ser Trp Gin 

3075 3080 3085 

Val Ser Ala Arg Phe Asn Gin Tyr Lys Tyr Asn Gin Asn Phe Ser Ala 

3090 3095 3100 

Gly Asn Asn Glu Asn lie Met Glu Ala His Val Gly lie Asn Gly Glu 
3105 3110 3115 3120 

Ala Asn Leu Asp Phe Leu Asn lie Pro Leu Thr lie Pro Glu Met Arg 

3125 3130 3135 

Leu Pro Tyr Thr lie lie Thr Thr Pro Pro Leu Lys Asp Phe Ser Leu 

3140 3145 3150 

Trp Glu Lys Thr Gly Leu Lys Glu Phe Leu Lys Thr Thr Lys Gin Ser 

3155 3160 3165 

Phe Asp Leu Ser Val Lys Ala Gin Tyr Lys Lys Asn Lys His Arg His 

3170 3175 3180 

Ser lie Thr Asn Pro Leu Ala Val Leu Cys Glu Phe lie Ser Gin Ser 
3185 3190 3195 3200 

lie Lys Ser Phe Asp Arg His Phe Glu Lys Asn Arg Asn Asn Ala Leu 

3205 3210 3215 

Asp Phe Val Thr Lys Ser Tyr Asn Glu Thr Lys lie Lys Phe Asp Lys 

3220 3225 3230 

Tyr Lys Ala Glu Lys Ser His Asp Glu Leu Pro Arg Thr Phe Gin lie 

3235 3240 3245 

Pro Gly Tyr Thr Val Pro Val Val Asn Val Glu Val Ser Pro Phe Thr 

3250 3255 3260 

He Glu Met Ser Ala Phe Gly Tyr Val Phe Pro Lys Ala Val Ser Met 
3265 3270 3275 3280 

Pro Ser Phe Ser lie Leu Gly Ser Asp Val Arg Val Pro Ser Tyr Thr 

3285 3290 3295 

Leu lie Leu Pro Ser Leu Glu Leu Pro Val Leu His Val Pro Arg Asn 

3300 3305 3310 

Leu Lys Leu Ser Leu Pro His Phe Lys Glu Leu Cys Thr He Ser His 

3315 3320 3325 

He Phe He Pro Ala Met Gly Asn He Thr Tyr Asp Phe Ser Phe Lys 

3330 3335 3340 

Ser Ser Val He Thr Leu Asn Thr Asn Ala Glu Leu Phe Asn Gin Ser 
3345 3350 3355 3360 

Asp He Val Ala His Leu Leu Ser Ser Ser Ser Ser Val He Asp Ala 

3365 3370 3375 

Leu Gin Tyr Lys Leu Glu Gly Thr Thr Arg Leu Thr Arg Lys Arg Gly 

3380 3385 3390 

Leu Lys Leu Ala Thr Ala Leu Ser Leu Ser Asn Lys Phe Val Glu Gly 

3395 3400 3405 

Ser His Asn Ser Thr Val Ser Leu Thr Thr Lys Asn Met Glu Val Ser 

3410 3415 3420 

Val Ala Lys Thr Thr Lys Ala Glu He Pro He Leu Arg Met Asn Phe 
3425 3430 3435 3440 

Lys Gin Glu Leu Asn Gly Asn Thr Lys Ser Lys Pro Thr Val Ser Ser 
3445 3450 3455 
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Ser Met Glu Phe Lys Tyr Asp Phe Asn Ser Ser Met Leu Tyr Ser Thr 

3460 3465 3470 

Ala Lys Gly Ala Val Asp His Lys Leu Ser Leu Glu Ser Leu Thr Ser 

3475 3480 3485 

Tyr Phe Ser lie Glu Ser Ser Thr Lys Gly Asp Val Lys Gly Ser Val 

3490 3495 3500 

Leu Ser Arg Glu Tyr Ser Gly Thr lie Ala Ser Glu Ala Asn Thr Tyr 
3505 3510 3515 3520 

Leu Asn Ser Lys Ser Thr Arg Ser Ser Val Lys Leu Gin Gly Thr Ser 

3525 3530 3535 

Lys lie Asp Asp lie Trp Asn Leu Glu Val Lys Glu Asn Phe Ala Gly 

3540 3545 3550 

Glu Ala Thr Leu Gin Arg lie Tyr Ser Leu Trp Glu His Ser Thr Lys 

3555 3560 3565 

Asn His Leu Gin Leu Glu Gly Leu Phe Phe Thr Asn Gly Glu His Thr 

3570 3575 3580 

Ser Lys Ala Thr Leu Glu Leu Ser Pro Trp Gin Met Ser Ala Leu Val 
3585 3590 3595 3600 

Gin Val His Ala Ser Gin Pro Ser Ser Phe His Asp Phe Pro Asp Leu 

3605 3610 3615 

Gly Gin Glu Val Ala Leu Asn Ala Asn Thr Lys Asn Gin Lys lie Arg 

3620 3625 3630 

Trp Lys Asn Glu Val Arg lie His Ser Gly Ser Phe Gin Ser Gin Val 

3635 3640 3645 

Glu Leu Ser Asn Asp Gin Glu Lys Ala His Leu Asp lie Ala Gly Ser 

3650 3655 3660 

Leu Glu Gly His Leu Arg Phe Leu Lys Asn lie lie Leu Pro Val Tyr 
3665 3670 3675 3680 

Asp Lys Ser Leu Trp Asp Phe Leu Lys Leu Asp Val Thr Thr Ser lie 

3685 3690 3695 

Gly Arg Arg Gin His Leu Arg Val Ser Thr Ala Phe Val Tyr Thr Lys 

3700 3705 3710 

Asn Pro Asn Gly Tyr Ser Phe Ser lie Pro Val Lys Val Leu Ala Asp 

3715 3720 3725 

Lys Phe lie Thr Pro Gly Leu Lys Leu Asn Asp Leu Asn Ser Val Leu 

3730 3735 3740 

Val Met Pro Thr Phe His Val Pro Phe Thr Asp Leu Gin Val Pro Ser 
3745 3750 3755 3760 

Cys Lys Leu Asp Phe Arg Glu lie Gin lie Tyr Lys Lys Leu Arg Thr 

3765 3770 3775 

Ser Ser Phe Ala Leu Asn Leu Pro Thr Leu Pro Glu Val Lys Phe Pro 

3780 3785 3790 

Glu Val Asp Val Leu Thr Lys Tyr Ser Gin Pro Glu Asp Ser Leu lie 

3795 3800 3805 

Pro Phe Phe Glu lie Thr Val Pro Glu Ser Gin Leu Thr Val Ser Gin 

3810 3815 3820 

Phe Thr Leu Pro Lys Ser Val Ser Asp Gly lie Ala Ala Leu Asp Leu 
3825 3830 3835 3840 

Asn Ala Val Ala Asn Lys lie Ala Asp Phe Glu Leu Pro Thr lie lie 

3845 3850 3855 

Val Pro Glu Gin Thr lie Glu lie Pro Ser lie Lys Phe Ser Val Pro 

3860 3865 3870 

Ala Gly He Val He Pro Ser Phe Gin Ala Leu Thr Ala Arg Phe Glu 

3875 3880 3885 

Val Asp Ser Pro Val Tyr Asn Ala Thr Trp Ser Ala Ser Leu Lys Asn 

3890 3895 3900 

Lys Ala Asp Tyr Val Glu Thr Val Leu Asp Ser Thr Cys Ser Ser Thr 
3905 3910 3915 3920 


wo 02/072604 


PCT/US02/06728 


-68- 


Val Gin Phe Leu Glu Tyr Glu Leu Asn Val Leu Gly Thr His Lys lie 

3925 3930 3935 

Glu Asp Gly Thr Leu Ala Ser Lys Thr Lys Gly Thr Leu Ala His Arg 

3940 3945 3950 

Asp Phe Ser Ala Glu Tyr Glu Glu Asp Gly Lys Phe Glu Gly Leu Gin 

3955 3960 3965 

Glu Trp Glu Gly Lys Ala His Leu Asn lie Lys Ser Pro Ala Phe Thr 

3970 3975 3980 

Asp Leu His Leu Arg Tyr Gin Lys Asp Lys Lys Gly lie Ser Thr Ser 
3985 3990 3995 4000 

Ala Ala Ser Pro Ala Val Gly Thr Val Gly Met Asp Met Asp Glu Asp 

4005 4010 4015 

Asp Asp Phe Ser Lys Trp Asn Phe Tyr Tyr Ser Pro Gin Ser Ser Pro 

4020 4025 4030 

Asp Lys Lys Leu Thr lie Phe Lys Thr Glu Leu Arg Val Arg Glu Ser 

4035 4040 4045 

Asp Glu Glu Thr Gin lie Lys Val Asn Trp Glu Glu Glu Ala Ala Ser 

4050 4055 4060 

Gly Leu Leu Thr Ser Leu Lys Asp Asn Val Pro Lys Ala Thr Gly Val 
4065 4070 4075 4080 

Leu Tyr Asp Tyr Val Asn Lys Tyr His Trp Glu His Thr Gly Leu Thr 

4085 4090 4095 

Leu Arg Glu Val Ser Ser Lys Leu Arg Arg Asn Leu Gin Asn Asn Ala 

4100 4105 4110 

Glu Trp Val Tyr Gin Gly Ala lie Arg Gin lie Asp Asp He Asp Val 

4115 4120 4125 

Arg Phe Gin Lys Ala Ala Ser Gly Thr Thr Gly Thr Tyr Gin Glu Trp 

4130 4135 4140 

Lys Asp Lys Ala Gin Asn Leu Tyr Gin Glu Leu Leu Thr Gin Glu Gly 
4145 4150 4155 4160 

Gin Ala Ser Phe Gin Gly Leu Lys Asp Asn Val Phe Asp Gly Leu Val 

4165 4170 4175 

Arg Val Thr Gin Lys Phe His Met Lys Val Lys His Leu He Asp Ser 

4180 4185 4190 

Leu He Asp Phe Leu Asn Phe Pro Arg Phe Gin Phe Pro Gly Lys Pro 

4195 4200 4205 

Gly He Tyr Thr Arg Glu Glu Leu Cys Thr Met Phe He Arg Glu Val 

4210 4215 4220 

Gly Thr Val Leu Ser Gin Val Tyr Ser Lys Val His Asn Gly Ser Glu 
4225 4230 4235 4240 

He Leu Phe Ser Tyr Phe Gin Asp Leu Val He Thr Leu Pro Phe Glu 

4245 4250 4255 

Leu Arg Lys His Lys Leu He Asp Val He Ser Met Tyr Arg Glu Leu 

4260 4265 4270 

Leu Lys Asp Leu Ser Lys Glu Ala Gin Glu Val Phe Lys Ala He Gin 

4275 4280 4285 

Ser Leu Lys Thr Thr Glu Val Leu Arg Asn Leu Gin Asp Leu Leu Gin 

4290 4295 4300 

Phe He Phe Gin Leu He Glu Asp Asn He Lys Gin Leu Lys Glu Met 
4305 4310 4315 4320 

Lys Phe Thr Tyr Leu He Asn Tyr He Gin Asp Glu He Asn Thr He 

4325 4330 4335 

Phe Asn Asp Tyr He Pro Tyr Val Phe Lys Leu Leu Lys Glu Asn Leu 

4340 4345 4350 

Cys Leu Asn Leu His Lys Phe Asn Glu Phe He Gin Asn Glu Leu Gin 

4355 4360 4365 

Glu Ala Ser Gin Glu Leu Gin Gin He His Gin Tyr He Met Ala Leu 
4370 4375 4380 
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Arg Glu Glu Tyr Phe Asp Pro Ser lie Val Gly Trp Thr Val Lys Tyr 
4385 4390 4395 4400 

Tyr Glu Leu Glu Glu Lys lie Val Ser Leu lie Lys Asn Leu Leu Val 

4405 4410 4415 

Ala Leu Lys Asp Phe His Ser Glu Tyr lie Val Ser Ala Ser Asn Phe 

4420 4425 4430 

Thr Ser Gin Leu Ser Ser Gin Val Glu Gin Phe Leu His Arg Asn lie 

4435 4440 4445 

Gin Glu Tyr Leu Ser lie Leu Thr Asp Pro Asp Gly Lys Gly Lys Glu 

4450 4455 4460 

Lys He Ala Glu Leu Ser Ala Thr Ala Gin Glu He He Lys Ser Gin 
4465 4470 4475 4480 

Ala He Ala Thr Lys Lys He He Ser Asp Tyr His Gin Gin Phe Arg 

4485 4490 4495 

Tyr Lys Leu Gin Asp Phe Ser Asp Gin Leu Ser Asp Tyr Tyr Glu Lys 

4500 4505 4510 

Phe He Ala Glu Ser Lys Arg Leu He Asp Leu Ser He Gin Asn Tyr 

4515 4520 4525 

His Thr Phe Leu He Tyr He Thr Glu Leu Leu Lys Lys Leu Gin Ser 

4530 4535 4540 

Thr Thr Val Met Asn Pro Tyr Met Lys Leu Ala Pro Gly Glu Leu Thr 
4545 4550 4555 4560 

He He Leu 


<210> 33 

<211> 2196 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (13) . . . (1983) 

<223> Nucleotide sequence encoding 

5 , 10 -methylenetetrahydof olate reductase (MTHFR) 

<400> 33 

aattccggag cc atg gtg aac gaa gcc aga gga aac age age etc aac cec 51 
Met Val Asn Glu Ala Arg Gly Asn Ser Ser Leu Asn Pro 
15 10 

tgc ttg gag ggc agt gcc age agt ggc agt gag age tec aaa gat agt 99 
Cys Leu Glu Gly Ser Ala Ser Ser Gly Ser Glu Ser Ser Lys Asp Ser 
15 20 25 

teg aga tgt tee ace ecg ggc ctg gae cct gag egg eat gag aga etc 147 
Ser Arg Cys Ser Thr Pro Gly Leu Asp Pro Glu Arg His Glu Arg Leu 
30 35 40 45 

egg gag aag atg agg egg cga ttg gaa tct ggt gae aag tgg ttc tee 195 
Arg Glu Lys Met Arg Arg Arg Leu Glu Ser Gly Asp Lys Trp Phe Ser 
50 55 60 

ctg gaa ttc ttc cct cct cga act get gag gga get gte aat etc ate 243 
Leu Glu Phe Phe Pro Pro Arg Thr Ala Glu Gly Ala Val Asn Leu He 
65 70 75 


tea agg ttt gae egg atg gca gca ggt ggc ecc etc tac ata gae gtg 


291 
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Ser Arg Phe Asp Arg Met Ala Ala Gly Gly Pro Leu Tyr He Asp Val 
80 85 90 

acc tgg cac cca gca ggt gac cct ggc tea gac aag gag acc tec tec 33 9 

Thr Trp His Pro Ala Gly Asp Pro Gly Ser Asp Lys Glu Thr Ser Ser 
95 100 105 

atg atg ate gee age aec gee gtg aac tac tgt gge ctg gag acc ate 387 
Met Met He Ala Ser Thr Ala Val Asn Tyr Cys Gly Leu Glu Thr He 
110 115 120 125 

ctg cac atg aee tgc tgc cgt cag cgc ctg gag gag ate acg ggc cat 43 5 

Leu His Met Thr Cys Cys Arg Gin Arg Leu Glu Glu He Thr Gly His 
130 135 140 

ctg cac aaa get aag cag ctg gge etg aag aac ate atg gcg etg egg 483 
Leu His Lys Ala Lys Gin Leu Gly Leu Lys Asn He Met Ala Leu Arg 
145 150 155 

gga gac cca ata ggt gac cag tgg gaa gag gag gag gga gge ttc aac 531 
Gly Asp Pro He Gly Asp Gin Trp Glu Glu Glu Glu Gly Gly Phe Asn 
160 165 170 

tae gca gtg gac ctg gtg aag cac ate cga agt gag ttt ggt gac tac 579 
Tyr Ala Val Asp Leu Val Lys His He Arg Ser Glu Phe Gly Asp Tyr 
175 180 185 

ttt gac ate tgt gtg gca ggt tac ecc aaa ggc cac cec gaa gca ggg 627 
Phe Asp He Cys Val Ala Gly Tyr Pro Lys Gly His Pro Glu Ala Gly 
190 195 200 205 

age ttt gag get gac ctg aag eae ttg aag gag aag gtg tet geg gga 675 
Ser Phe Glu Ala Asp Leu Lys His Leu Lys Glu Lys Val Ser Ala Gly 
210 215 220 

gcc gat ttc ate ate aeg cag ctt tte ttt gag get gac aea ttc ttc 723 
Ala Asp Phe He He Thr Gin Leu Phe Phe Glu Ala Asp Thr Phe Phe 
225 230 235 

cgc ttt gtg aag gca tgc acc gac atg ggc ate act tgc cec ate gtc 771 
Arg Phe Val Lys Ala Cys Thr Asp Met Gly He Thr Cys Pro He Val 
240 245 250 

cec ggg ate ttt cec ate cag ggc tac cac tec ctt egg eag ctt gtg 819 
Pro Gly He Phe Pro He Gin Gly Tyr His Ser Leu Arg Gin Leu Val 
255 260 265 

aag ctg tee aag etg gag gtg cca cag gag ate aag gac gtg att gag 867 
Lys Leu Ser Lys Leu Glu Val Pro Gin Glu He Lys Asp Val He Glu 
270 275 280 285 

cca ate aaa gac aac gat get gee ate ege aae tat ggc ate gag ctg 915 
Pro He Lys Asp Asn Asp Ala Ala He Arg Asn Tyr Gly He Glu Leu 
290 295 300 

gee gtg age etg tgc cag gag ctt ctg gee agt gge ttg gtg cca gge 963 
Ala Val Ser Leu Cys Gin Glu Leu Leu Ala Ser Gly Leu Val Pro Gly 
305 310 315 
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ctc cac ttc tac acc etc aac cgc gag atg get acc aca gag gtg ctg 1011 
Leu His Phe Tyr Thr Leu Asn Arg Glu Met Ala Thr Thr Glu Val Leu 
320 325 330 

aag cgc ctg ggg atg tgg act gag gac ccc agg cgt ccc eta ccc tgg 1059 
Lys Arg Leu Gly Met Trp Thr Glu Asp Pro Arg Arg Pro Leu Pro Trp 
335 340 345 

get etc agt gee eae eee aag cgc cga gag gaa gat gta cgt ccc ate 1107 
Ala Leu Ser Ala His Pro Lys Arg Arg Glu Glu Asp Val Arg Pro lie 
350 355 360 365 

ttc tgg gcc tec aga cca aag agt tac ate tac cgt ace cag gag tgg 1155 
Phe Trp Ala Ser Arg Pro Lys Ser Tyr lie Tyr Arg Thr Gin Glu Trp 
370 375 380 

gac gag ttc cet aac ggc cgc tgg gge aat tec tct tec cct gcc ttt 1203 
Asp Glu Phe Pro Asn Gly Arg Trp Gly Asn Ser Ser Ser Pro Ala Phe 
385 390 395 

ggg gag ctg aag gac tac tac etc ttc tac ctg aag age aag tec ccc 1251 
Gly Glu Leu Lys Asp Tyr Tyr Leu Phe Tyr Leu Lys Ser Lys Ser Pro 
400 405 410 

aag gag gag ctg ctg aag atg tgg ggg gag gag ctg acc agt gaa gea 12 99 

Lys Glu Glu Leu Leu Lys Met Trp Gly Glu Glu Leu Thr Ser Glu Ala 
415 420 425 

agt gte ttt gaa gtc ttt gtt ctt tac etc teg gga gaa cca aac egg 1347 
Ser Val Phe Glu Val Phe Val Leu Tyr Leu Ser Gly Glu Pro Asn Arg 
430 435 440 445 

aat ggt cac aaa gtg act tgc ctg ccc tgg aac gat gag ccc ctg gcg 1395 
Asn Gly His Lys Val Thr Cys Leu Pro Trp Asn Asp Glu Pro Leu Ala 
450 455 460 

get gag acc age ctg ctg aag gag gag ctg ctg egg gtg aac cgc cag 1443 
Ala Glu Thr Ser Leu Leu Lys Glu Glu Leu Leu Arg Val Asn Arg Gin 
465 470 475 

ggc ate etc acc ate aac tea cag ccc aac ate aac ggg aag ceg tec 1491 
Gly lie Leu Thr lie Asn Ser Gin Pro Asn lie Asn Gly Lys Pro Ser 
480 485 490 

tee gac ccc ate gtg ggc tgg ggc ccc age ggg ggc tat gtc ttc cag 1539 
Ser Asp Pro lie Val Gly Trp Gly Pro Ser Gly Gly Tyr Val Phe Gin 
495 500 505 

aag gcc tac tta gag ttt ttc act tec cgc gag aca gcg gaa gea ctt 1587 
Lys Ala Tyr Leu Glu Phe Phe Thr Ser Arg Glu Thr Ala Glu Ala Leu 
510 515 520 525 

ctg caa gtg ctg aag aag tac gag etc egg gtt aat tac cac ctt gte 1635 
Leu Gin Val Leu Lys Lys Tyr Glu Leu Arg Val Asn Tyr His Leu Val 
530 535 540 

aat gtg aag ggt gaa aac ate ace aat gcc cet gaa ctg eag ceg aat 1683 
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Asn Val Lys Gly Glu Asn lie Thr Asn Ala Pro Glu Leu Gin Pro Asn 
545 550 555 

get gtc act tgg ggc ate ttc act ggg cga gag ate ate cag cee aec 1731 
Ala Val Thr Trp Gly lie Phe Pro Gly Arg Glu lie lie Gin Pro Thr 
560 565 570 

gta gtg gat eee gtc age ttc atg ttc tgg aag gae gag gcc ttt gcc 1779 
Val Val Asp Pro Val Ser Phe Met Phe Trp Lys Asp Glu Ala Phe Ala 
575 580 585 

ctg tgg att gag egg tgg gga aag ctg tat gag gag gag tec ceg tec 1827 
Leu Trp lie Glu Arg Trp Gly Lys Leu Tyr Glu Glu Glu Ser Pro Ser 
590 595 600 605 

cge ace ate ate eag tac ate eac gac aac tac tte etg gtc aae etg 1875 
Arg Thr lie lie Gin Tyr lie His Asp Asn Tyr Phe Leu Val Asn Leu 
610 615 620 

gtg gae aat gae ttc cea ctg gac aac tgc etc tgg cag gtg gtg gaa 1923 
Val Asp Asn Asp Phe Pro Leu Asp Asn Cys Leu Trp Gin Val Val Glu 
625 630 635 

gae aca ttg gag ctt etc aae agg cee aec cag aat gcg aga gaa acg 1971 
Asp Thr Leu Glu Leu Leu Asn Arg Pro Thr Gin Asn Ala Arg Glu Thr 
640 645 650 

gag get cca tga ccctgcgtec tgacgcectg cgttggagcc actcetgtce 2023 
Glu Ala Pro * 
655 

cgccttecte etceacagtg etgcttctct tgggaactcc actctccttc gtgtetetec 2083 

caceeeggcc tccactccce cacctgacaa tggeagetag aetggagtga ggctteeagg 2143 

ctettectgg acctgagteg gccceaeatg ggaaeetagt aetctctget eta 2196 

<210> 34 

<211> 656 

<212> PRT 

<213> Homo sapien 

<400> 34 

Met Val Asn Glu Ala Arg Gly Asn Ser Ser Leu Asn Pro Cys Leu Glu 

15 10 15 

Gly Ser Ala Ser Ser Gly Ser Glu Ser Ser Lys Asp Ser Ser Arg Cys 

20 25 30 

Ser Thr Pro Gly Leu Asp Pro Glu Arg His Glu Arg Leu Arg Glu Lys 

35 40 45 

Met Arg Arg Arg Leu Glu Ser Gly Asp Lys Trp Phe Ser Leu Glu Phe 

50 55 60 

Phe Pro Pro Arg Thr Ala Glu Gly Ala Val Asn Leu lie Ser Arg Phe 
65 70 75 80 

Asp Arg Met Ala Ala Gly Gly Pro Leu Tyr lie Asp Val Thr Trp His 

85 90 95 

Pro Ala Gly Asp Pro Gly Ser Asp Lys Glu Thr Ser Ser Met Met lie 

100 105 110 

Ala Ser Thr Ala Val Asn Tyr Cys Gly Leu Glu Thr lie Leu His Met 

115 120 125 

Thr Cys Cys Arg Gin Arg Leu Glu Glu lie Thr Gly His Leu His Lys 
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130 



Ala 

Lys 

Gin 

Leu 

145 




He 

Gly 

Asp 

Gin 

Asp 

Leu 

Val 

Lys 




180 

Cys 

Val 

Ala 

Gly 



195 


Ala 

Asp 

Leu 

Lys 


210 



He 

He 

Thr 

Gin 

225 




Lys 

Ala 

Cys 

Thr 

Phe 

Pro 

He 

Gin 




9 n 

Lys 

Leu 

Glu 

Val 



275 


Asp 

Asn 

Asp 

Ala 


290 



Leu 

Cys 

Gin 

Glu 

305 




Tyr 

Thr 

Leu 

Asn 

Gly 

Met 

Trp 

Thr 




3 4 0 

Ala 

His 

Pro 

Lys 



355 


Ser 

Arg 

Pro 

Lys 


370 



Pro 

Asn 

Gly 

Arg 

385 




Lys 

Asp 

Tyr 

Tyr 

Leu 

Leu 

Lys 

Met 




^ u 

Glu 

Val 

Phe 

Val 



435 


Lys 

Val 

Thr 

Cys 


450 



Ser 

Leu 

Leu 

Lys 

465 




Thr 

He 

Asn 

Ser 

He 

Val 

Gly 

Trp 




500 

Leu 

Glu 

Phe 

Phe 



515 


Leu 

Lys 

Lys 

Tyr 


530 



Gly 

Glu 

Asn 

He 

545 




Trp 

Gly 

He 

Phe 

Pro 

Val 

Ser 

Phe 




580 

Glu 

Arg 

Trp 

Gly 


135 

Gly Leu Lys Asn 
150 

Trp Glu Glu Glu 
165 

His He Arg Ser 

Tyr Pro Lys Gly 
200 

His Leu Lys Glu 
215 

Leu Phe Phe Glu 
230 

Asp Met Gly He 
245 

Gly Tyr His Ser 

Pro Gin Glu He 
280 

Ala He Arg Asn 
295 

Leu Leu Ala Ser 
310 

Arg Glu Met Ala 
325 

Glu Asp Pro Arg 

Arg Arg Glu Glu 
360 

Ser Tyr He Tyr 
375 

Trp Gly Asn Ser 
390 

Leu Phe Tyr Leu 
405 

Trp Gly Glu Glu 

Leu Tyr Leu Ser 
440 

Leu Pro Trp Asn 
455 

Glu Glu Leu Leu 
470 

Gin Pro Asn He 
485 

Gly Pro Ser Gly 

Thr Ser Arg Glu 
520 

Glu Leu Arg Val 
535 

Thr Asn Ala Pro 
550 

Pro Gly Arg Glu 
565 

Met Phe Trp Lys 
Lys Leu Tyr Glu 





± fi u 

He 

Met 

Ala 

Leu 



155 


Glu 

Gly 

Gly 

Phe 


170 



Glu 

Phe 

Gly 

Asp 

185 




His 

Pro 

Glu 

Ala 

Lys 

Val 

Ser 

Ala 




o o n 

Ala 

Asp 

Thr 

Phe 



235 


Thr 

Cys 

Pro 

He 


250 



Leu 

Arg 

Gin 

Leu 

265 




Lys 

Asp 

Val 

He 

Tyr 

Gly 

He 

Glu 




T A n 

Gly 

Leu 

Val 

Pro 



315 


Thr 

Thr 

Glu 

Val 


330 



Arg 

Pro 

Leu 

Pro 

345 




Asp 

Val 

Arg 

Pro 

Arg 

Thr 

Gin 

Glu 




T Q n 

Ser 

Ser 

Pro 

Ala 



395 


Lys 

Ser 

Lys 

Ser 


410 



Leu 

Thr 

Ser 

Glu 

425 




Gly 

Glu 

Pro 

Asn 

Asp 

Glu 

Pro 

Leu 




460 

Arg 

Val 

Asn 

Arg 



475 


Asn 

Gly 

Lys 

Pro 


490 



Gly 

Tyr 

Val 

Phe 

505 




Thr 

Ala 

Glu 

Ala 

Asn 

Tyr 

His 

Leu 




540 

Glu 

Leu 

Gin 

Pro 



555 


He 

He 

Gin 

Pro 


570 



Asp 

Glu 

Ala 

Phe 

585 




Glu 

Glu 

Ser 

Pro 


Arg 


Asp 

Pro 




160 

Asn 

Tyr 

Ala 

Val 



175 


Tyr 

Phe 

Asp 

He 


190 



Gly 

Ser 

Phe 

Glu 

205 




Gly 

Ala 

Asp 

Phe 

Phe 

Arg 

Phe 

Val 




240 

Val 

Pro 

Gly 

He 



255 


Val 

Lys 

Leu 

Ser 


270 



Glu 

Pro 

He 

Lys 

285 




Leu 

Ala 

Val 

Ser 

o±y 

Leu 

rl 1 S 

XT ne 




320 

Leu 

Lys 

Arg 

Leu 



335 


Trp 

Ala 

Leu 

Ser 


350 



He 

Phe 

Trp 

Ala 

365 




Trp 

Asp 

Glu 

Phe 

£r lie 

J. y 

VjJ J. u 

Leu 




400 

Pro 

Lys 

Glu 

Glu 



415 


Ala 

Ser 

Val 

Phe 


430 



Arg 

Asn 

Gly 

His 

445 




Ala 

Ala 

Glu 

Thr 

Gin 

Gly 

He 

Leu 




480 

Ser 

Ser 

Asp 

Pro 



495 


Gin 

Lys 

Ala 

Tyr 


510 



Leu 

Leu 

Gin 

Val 

525 




Val 

Asn 

Val 

Lys 

Asn 

Ala 

Val 

Thr 




560 

Thr 

Val 

Val 

Asp 



575 


Ala 

Leu 

Trp 

He 


590 



Ser 

Arg 

Thr 

He 
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595 600 605 

lie Gin Tyr lie His Asp Asn Tyr Phe Leu Val Asn Leu Val Asp Asn 

610 615 620 

Asp Phe Pro Leu Asp Asn Cys Leu Trp Gin Val Val Glu Asp Thr Leu 
625 630 635 640 

Glu Leu Leu Asn Arg Pro Thr Gin Asn Ala Arg Glu Thr Glu Ala Pro 
645 650 655 

<210> 35 

<211> 3834 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (117) . . . (1949) 

<223> Nucleotide sequence encoding selectin E (SELE) 
<400> 35 

cctgagacag aggcagcagt gatacccacc tgagagatcc tgtgtttgaa caactgcttc 60 
ccaaaacgga aagtatttca agcctaaacc tttgggtgaa aagaactctt gaagtc atg 119 

Met 
1 

att get tea eag ttt etc tea get cte act ttg gtg ett cte att aaa 167 
lie Ala Ser Gin Phe Leu Ser Ala Leu Thr Leu Val Leu Leu lie Lys 
5 10 15 

gag agt gga gee tgg tct tae aac ace tee aeg gaa get atg act tat 215 
Glu Ser Gly Ala Trp Ser Tyr Asn Thr Ser Thr Glu Ala Met Thr Tyr 
20 25 30 

gat gag gee agt get tat tgt cag caa agg tac aca cac ctg gtt gca 263 
Asp Glu Ala Ser Ala Tyr Cys Gin Gin Arg Tyr Thr His Leu Val Ala 
35 40 45 

att caa aac aaa gaa gag att gag tac eta aac tec ata ttg age tat 311 
lie Gin Asn Lys Glu Glu lie Glu Tyr Leu Asn Ser lie Leu Ser Tyr 
50 55 60 65 

tea cca agt tat tae tgg att gga ate aga aaa gtc aac aat gtg tgg 359 
Ser Pro Ser Tyr Tyr Trp lie Gly lie Arg Lys Val Asn Asn Val Trp 
70 75 80 

gtc tgg gta gga ace cag aaa cet ctg aca gaa gaa gee aag aac tgg 407 
Val Trp Val Gly Thr Gin Lys Pro Leu Thr Glu Glu Ala Lys Asn Trp 
85 90 95 

get cca ggt gaa ccc aac aat agg caa aaa gat gag gac tgc gtg gag 455 
Ala Pro Gly Glu Pro Asn Asn Arg Gin Lys Asp Glu Asp Cys Val Glu 
100 105 110 

ate tac ate aag aga gaa aaa gat gtg ggc atg tgg aat gat gag agg 503 
lie Tyr lie Lys Arg Glu Lys Asp Val Gly Met Trp Asn Asp Glu Arg 
115 120 125 

tgc age aag aag aag ett gee eta tgc tac aca get gee tgt ace aat 551 
Cys Ser Lys Lys Lys Leu Ala Leu Cys Tyr Thr Ala Ala Cys Thr Asn 
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130 135 140 145 

aca tec tgc agt ggc cac ggt gaa tgt gta gag acc ate aat aat tac 599 
Thr Ser Cys Ser Gly His Gly Glu Cys Val Glu Thr lie Asn Asn Tyr 
150 155 160 . 

act tgc aag tgt gac act ggc ttc agt gga etc aag tgt gag caa att 647 
Thr Cys Lys Cys Asp Pro Gly Phe Ser Gly Leu Lys Cys Glu Gin lie 
165 170 175 

gtg aac tgt aca gee ctg gaa tec ect gag cat gga age ctg gtt tgc 695 
Val Asn Cys Thr Ala Leu Glu Ser Pro Glu His Gly Ser Leu Val Cys 
180 185 190 

agt cac cca ctg gga aac ttc age tac aat tct tec tgc tct ate age 743 
Ser His Pro Leu Gly Asn Phe Ser Tyr Asn Ser Ser Cys Ser lie Ser 
195 200 205 

tgt gat agg ggt tac ctg eea age age atg gag acc atg eag tgt atg 791 
Cys Asp Arg Gly Tyr Leu Pro Ser Ser Met Glu Thr Met Gin Cys Met 
210 215 220 225 

tee tct gga gaa tgg agt get ect att eea gee tgc aat gtg gtt gag 839 
Ser Ser Gly Glu Trp Ser Ala Pro He Pro Ala Cys Asn Val Val Glu 
230 235 240 

tgt gat get gtg aca aat cca gee aat ggg ttc gtg gaa tgt ttc caa 887 
Cys Asp Ala Val Thr Asn Pro Ala Asn Gly Phe Val Glu Cys Phe Gin 
245 250 255 

aac ect gga age ttc cca tgg aac aca acc tgt aca ttt gac tgt gaa 935 
Asn Pro Gly Ser Phe Pro Trp Asn Thr Thr Cys Thr Phe Asp Cys Glu 
260 265 270 

gaa gga ttt gaa eta atg gga gee cag age ett eag tgt acc tea tct 983 
Glu Gly Phe Glu Leu Met Gly Ala Gin Ser Leu Gin Cys Thr Ser Ser 
275 280 285 

ggg aat tgg gac aac gag aag cca aeg tgt aaa get gtg aca tgc agg 1031 
Gly Asn Trp Asp Asn Glu Lys Pro Thr Cys Lys Ala Val Thr Cys Arg 
290 295 300 305 

gee gtc cge cag ect cag aat ggc tct gtg agg tgc age eat tec ect 1079 
Ala Val Arg Gin Pro Gin Asn Gly Ser Val Arg Cys Ser His Ser Pro 
310 315 320 

get gga gag ttc ace ttc aaa tea tee tgc aac ttc acc tgt gag gaa 1127 
Ala Gly Glu Phe Thr Phe Lys Ser Ser Cys Asn Phe Thr Cys Glu Glu 
325 330 335 

ggc ttc atg ttg eag gga cca gee cag gtt gaa tgc ace act caa ggg 1175 
Gly Phe Met Leu Gin Gly Pro Ala Gin Val Glu Cys Thr Thr Gin Gly 
340 345 350 

cag tgg aca eag caa ate cca gtt tgt gaa get ttc cag tgc aca gee 1223 
Gin Trp Thr Gin Gin lie Pro Val Cys Glu Ala Phe Gin Cys Thr Ala 
355 360 365 
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ttg tec aac ccc gag cga ggc tac atg aat tgt ctt cct agt get tct 1271 
Leu Ser Asn Pro Glu Arg Gly Tyr Met Asn Cys Leu Pro Ser Ala Ser 
370 375 380 385 

ggc agt ttc cgt tat ggg tec age tgt gag ttc tec tgt gag cag ggt 1319 
Gly Ser Phe Arg Tyr Gly Ser Ser Cys Glu Phe Ser Cys Glu Gin Gly 
390 395 400 

ttt gtg ttg aag gga tee aaa agg etc caa tgt ggc ccc aca ggg gag 1367 
Phe Val Leu Lys Gly Ser Lys Arg Leu Gin Cys Gly Pro Thr Gly Glu 
405 410 415 

tgg gac aae gag aag ccc aca tgt gaa get gtg aga tgc gat get gte 1415 
Trp Asp Asn Glu Lys Pro Thr Cys Glu Ala Val Arg Cys Asp Ala Val 
420 425 430 

cae eag eee eeg aag ggt ttg gtg agg tgt get eat tee eet att gga 1463 
His Gin Pro Pro Lys Gly Leu Val Arg Cys Ala His Ser Pro lie Gly 
435 440 445 

gaa ttc ace tac aag tec tct tgt gee ttc age tgt gag gag gga ttt 1511 
Glu Phe Thr Tyr Lys Ser Ser Cys Ala Phe Ser Cys Glu Glu Gly Phe 
450 455 460 465 

gaa tta tat gga tea act caa ctt gag tgc aca tct eag gga caa tgg 1559 
Glu Leu Tyr Gly Ser Thr Gin Leu Glu Cys Thr Ser Gin Gly Gin Trp 
470 475 480 

aca gaa gag gtt cct tec tgc caa gtg gta aaa tgt tea age ctg gea 1607 
Thr Glu Glu Val Pro Ser Cys Gin Val Val Lys Cys Ser Ser Leu Ala 
485 490 495 

gtt ceg gga aag ate aac atg age tgc agt ggg gag eee gtg ttt ggc 1655 
Val Pro Gly Lys lie Asn Met Ser Cys Ser Gly Glu Pro Val Phe Gly 
500 505 510 

act gtg tgc aag ttc gee tgt cct gaa gga tgg acg etc aat ggc tct 1703 
Thr Val Cys Lys Phe Ala Cys Pro Glu Gly Trp Thr Leu Asn Gly Ser 
515 520 525 

gea get egg aca tgt gga gee aca gga cac tgg tct ggc ctg eta eet 1751 
Ala Ala Arg Thr Cys Gly Ala Thr Gly His Trp Ser Gly Leu Leu Pro 
530 535 540 545 

ace tgt gaa get eee act gag tec aae att ccc ttg gta get gga ctt 1799 
Thr Cys Glu Ala Pro Thr Glu Ser Asn lie Pro Leu Val Ala Gly Leu 
550 555 560 

tct get get gga etc tec etc ctg aca tta gea cea ttt etc etc tgg 1847 
Ser Ala Ala Gly Leu Ser Leu Leu Thr Leu Ala Pro Phe Leu Leu Trp 
565 570 575 

ctt egg aaa tgc tta egg aaa gea aag aaa ttt gtt cct gee age age 1895 
Leu Arg Lys Cys Leu Arg Lys Ala Lys Lys Phe Val Pro Ala Ser Ser 
580 585 590 

tgc caa age ctt gaa tea gac gga age tac caa aag cct tct tac ate 1943 
Cys Gin Ser Leu Glu Ser Asp Gly Ser Tyr Gin Lys Pro Ser Tyr lie 
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595 


600 


605 


ctt taa gttcaaaaga atcagaaaca ggtgcatctg gggaactaga gggatacact 

Leu * 

610 


1999 


gaagt taaca 
gatgccttta 
ggcaaggacg 
aaagtgttgg 
actctgaaat 
agaaacacaa 
tacacagttg 
ctaaaaatat 
agggt tgtta 
taatcacttt 
tttttacttg 
tagggact ta 
agattacccc 
aaagggaaac 
ttttactaca 
gt ttgtatat 
tgaggccaaa 
atgaatgtct 
atgtttgtca 
acaatagaaa 
taatagttat 
cctgtgtgag 
ccagttttca 
ctagccttga 
tgaaaaaaaa 
aaaacagaga 
ctgggaaata 
tctttgaaat 
tgaaatacca 
tattttataa 
cactataaag 


gagacagata 
tggctgaaac 
gccttcaact 
ctaatgaagg 
ctcagaattc 
tattttgtgg 
ctgtcataag 
tataacttaa 
atggtgcaaa 
catccctatg 
cattgaatat 
aaaaacttgt 
ctcattgttt 
tattgtcccc 
gttgtttttt 
tgtcagatat 
cgttttgaac 
ttgttggaaa 
gatgtgatat 
ctcaccatgt 
tccctatttg 
caagcattta 
gttgatcact 
ggagtgtgag 
agtttcagag 
tgtgataagg 
agagagaaca 
tgtttaagtg 
gtgtggtttg 
tttttaaagt 
gttgacaata 


actctcctcg 
cgcaacaccc 
gaaaagactc 
gaaaggatat 
cttttctaac 
ctttctttct 
aatgaataat 
aaaaatgaca 
tcctactgaa 
ggattcagtg 
attataatct 
aaatgctgtc 
attaacaaat 
tagcaaggca 
gaaaacatgg 
tttttcagaa 
taataaaagc 
agaatacaga 
gtaaacataa 
aaaagagtca 
ttttcttctg 
tgtttattta 
ggcaatgaaa 
aatcaaaact 
aagttctggc 
atcagaacag 
actactgtga 
ttgtaaatat 
tgtttgagtt 
atatatttat 
aatgtgctta 


ggtctctggc 
atcaccactt 
agtgttccct 
tttcttccaa 
tctcccttgc 
tttgcccttc 
aat tatccag 
gatgttgaat 
tgctctgtgc 
cttcttaaag 
tccatacttc 
aactatgata 
tatgttacat 
tgatgttaac 
tagaattgga 
atatgtggtt 
ataaatgcaa 
aagatggatg 
ttcttgtata 
tctggtagat 
tatgttaggg 
taagcagatt 
aattctcagt 
ctcctacact 
tgaacactgg 
cagaggttct 
tcaggctatg 
ttatgtaaac 
t tattgagaa 
ttaagcttat 
tgttt 


ccttcttgcc 
caatagatca 
ttcctactct 
gcaaaggtga 
tcgctgtaaa 
acagtgtttc 
agtttagagg 
gcccacaggc 
gagggttact 
agttcttaag 
ttcattcaat 
tggtaaaagt 
ctgttttaaa 
cagaataaag 
gagtaaaaac 
tccacgatga 
acacacaaag 
tgctttgcat 
ttatggaaga 
ttttaacgaa 
tgctctggaa 
taacaattcc 
cagtaattgc 
tccattaact 
caacgacaaa 
tttaaagggg 
tatggaatac 
tgcattagaa 
ttttaaatta 
gtcagaccta 


tactatgcca 
aagtccagca 
caggatcaag 
agagaccaag 
atct tggcac 
gacagctgat 
aaaaaaatga 
aaatgcatgg 
atgcacaatt 
gattgtgata 
acaagtgtgg 
tacttattct 
tt tatttcaa 
ttctgagtgt 
tgaatggaag 
aaaacttcca 
gtataatttt 
tcctacaaag 
ttttaaattc 
tgaagatgtc 
gagaggaatg 
aaaggaatct 
caaagctgct 
tagcatgtgt 
gccaacagtc 
cagaaaaact 
agtgttattt 
attagctgtg 
taacttaaaa 
tttgacataa 


2059 
2119 
2179 
2239 
2299 
2359 
2419 
2479 
2539 
2599 
2659 
2719 
2779 
2839 
2899 
2959 
3019 
3079 
3139 
3199 
3259 
3319 
3379 
3439 
3499 
3559 
3619 
3679 
3739 
3799 
3834 


<210> 36 

<211> 610 

<212> PRT 

<213> Homo sapien 


<400> 36 


Met 

He 

Ala 

Ser 

Gin 

Phe 

Leu 

Ser 

Ala 

Leu 

Thr 

Leu 

Val 

Leu 

Leu 

He 

1 




5 





10 





15 


Lys 

Glu 

Ser 

Gly 

Ala 

Trp 

Ser 

Tyr 

Asn 

Thr 

Ser 

Thr 

Glu 

Ala 

Met 

Thr 



20 





25 





30 



Tyr 

Asp 

Glu 

Ala 

Ser 

Ala 

Tyr 

Cys 

Gin 

Gin 

Arg 

Tyr 

Thr 

His 

Leu 

Val 



35 





40 





45 




Ala 

He 

Gin 

Asn 

Lys 

Glu 

Glu 

He 

Glu 

Tyr 

Leu 

Asn 

Ser 

He 

Leu 

Ser 


50 





55 





60 





Tyr 

Ser 

Pro 

Ser 

Tyr 

Tyr 

Trp 

He 

Gly 

He 

Arg 

Lys 

Val 

Asn 

Asn 

Val 

65 





70 





75 





80 

Trp 

Val 

Trp 

Val 

Gly 

Thr 

Gin 

Lys 

Pro 

Leu 

Thr 

Glu 

Glu 

Ala 

Lys 

Asn 





85 





90 





95 


Trp 

Ala 

Pro 

Gly 

Glu 

Pro 

Asn 

Asn 

Arg 

Gin 

Lys 

Asp 

Glu 

Asp 

Cys 

Val 


100 


105 


110 


wo 02/072604 


Glu 

He 

Tyr 

He 

Lys 

Arg 

Glu 

Lys 

Asp 



115 





120 


Arg 

Cys 

Ser 

Lys 

Lys 

Lys 

Leu 

Ala 

Leu 


130 





135 



Asn 

Thr 

Ser 

Cys 

Ser 

Gly 

His 

Gly 

Glu 

145 





150 




Tyr 

Thr 

Cys 

Lys 

Cys 

Asp 

Pro 

Gly 

Phe 





165 





lie 

Val 

Asn 

Cys 

Thr 

Ala 

Leu 

Glu 

Ser 




180 





185 

Cys 

Ser 

His 

Pro 

Leu 

Gly 

Asn 

Phe 

Ser 



195 





200 


Ser 

Cys 

Asp 

Arg 

Gly 

Tyr 

Leu 

Pro 

Ser 


210 





215 



Met 

Ser 

Ser 

Gly 

Glu 

Trp 

Ser 

Ala 

Pro 

225 





230 




Glu 

Cys 

Asp 

Ala 

Val 

Thr 

Asn 

Pro 

Ala 





245 





Gin 

Asn 

Pro 

Gly 

Ser 

Phe 

Pro 

Trp 

Asn 




260 





265 

Glu 

Glu 

Gly 

Phe 

Glu 

Leu 

Met 

Gly Ala 



275 





280 


Ser 

Gly 

Asn 

Trp 

Asp 

Asn 

Glu 

Lys 

Pro 


290 





295 



Arg 

Ala 

Val 

Arg 

Gin 

Pro 

Gin 

Asn 

Gly 

305 





310 




Pro 

Ala 

Gly 

Glu 

Phe 

Thr 

Phe 

Lys 

Ser 





325 





Glu 

Gly 

Phe 

Met 

Leu 

Gin 

Gly 

Pro 

Ala 




340 





345 

Gly 

Gin 

Trp 

Thr 

Gin 

Gin 

He 

Pro 

Val 



355 





360 


Ala 

Leu 

Ser 

Asn 

Pro 

Glu 

Arg 

Gly 

Tyr 


370 





375 



Ser 

Gly 

Ser 

Phe 

Arg 

Tyr 

Gly 

Ser 

Ser 

385 





390 




Gly 

Phe 

Val 

Leu 

Lys 

Gly 

Ser 

Lys 

Arg 





405 





Glu 

Trp 

Asp 

Asn 

Glu 

Lys 

Pro 

Thr 

Cys 




420 





425 

Val 

His 

Gin 

Pro 

Pro 

Lys 

Gly 

Leu 

Val 



435 





440 


Gly 

Glu 

Phe 

Thr 

Tyr 

Lys 

Ser 

Ser 

Cys 


450 





455 



Phe 

Glu 

Leu 

Tyr 

Gly 

Ser 

Thr 

Gin 

Leu 

465 





470 




Trp 

Thr 

Glu 

Glu 

Val 

Pro 

Ser 

Cys 

Gin 





485 





Ala 

Val 

Pro 

Gly 

Lys 

He 

Asn 

Met 

Ser 




500 





505 

Gly 

Thr 

Val 

Cys 

Lys 

Phe 

Ala 

Cys 

Pro 



515 





520 


Ser 

Ala 

Ala 

Arg 

Thr 

Cys 

Gly 

Ala 

Thr 


530 





535 



Pro 

Thr 

Cys 

Glu 

Ala 

Pro 

Thr 

Glu 

Ser 

545 





550 




Leu 

Ser 

Ala 

Ala 

Gly 

Leu 

Ser 

Leu 

Leu 
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Val 

Gly 

Met 

Trp 

Asn 

Asp 

Glu 




125 




Cys 

Tyr 

Thr 

Ala 

Ala 

Cys 

Thr 



140 





Cys 

Val 

Glu 

Thr 

He 

Asn 

Asn 


155 





160 

Ser 

Gly 

Leu 

Lys 

Cys 

Glu 

Gin 

170 





175 


Pro 

Glu 

His 

Gly 

Ser 

Leu 

Val 





190 



Tyr 

Asn 

Ser 

Ser 

Cys 

Ser 

He 




205 




Ser 

Met 

Glu 

Thr 

Met 

Gin 

Cys 



220 





He 

Pro 

Ala 

Cys 

Asn 

Val 

Val 


235 





240 

Asn 

Gly 

Phe 

Val 

Glu 

Cys 

Phe 

250 





255 


Thr 

Thr 

Cys 

Thr 

Phe 

Asp 

Cys 





270 



Gin 

Ser 

Leu 

Gin 

Cys 

Thr 

Ser 




285 




Thr 

Cys 

Lys 

Ala 

Val 

Thr 

Cys 



300 





Ser 

Val 

Arg 

Cys 

Ser 

His 

Ser 


315 





320 

Ser 

Cys 

Asn 

Phe 

Thr 

Cys 

Glu 

330 





335 


Gin 

Val 

Glu 

Cys 

Thr 

Thr 

Gin 





350 



Cys 

Glu 

Ala 

Phe 

Gin 

Cys 

Thr 




365 




Met 

Asn 

Cys 

Leu 

Pro 

Ser 

Ala 



380 





Cys 

Glu 

Phe 

Ser 

Cys 

Glu 

Gin 


395 





400 

Leu 

Gin 

Cys 

Gly 

Pro 

Thr 

Gly 

410 





415 


Glu 

Ala 

Val 

Arg 

Cys 

Asp 

Ala 





430 



Arg 

Cys 

Ala 

His 

Ser 

Pro 

He 




445 




Ala 

Phe 

Ser 

Cys 

Glu 

Glu 

Gly 



460 





Glu 

Cys 

Thr 

Ser 

Gin 

Gly 

Gin 


475 





480 

Val 

Val 

Lys 

Cys 

Ser 

Ser 

Leu 

490 





495 


Cys 

Ser 

Gly 

Glu 

Pro 

Val 

Phe 





510 



Glu 

Gly 

Trp 

Thr 

Leu 

Asn 

Gly 




525 




Gly 

His 

Trp 

Ser 

Gly 

Leu 

Leu 



540 





Asn 

He 

Pro 

Leu 

Val 

Ala 

Gly 


555 





560 

Thr 

Leu 

Ala 

Pro 

Phe 

Leu 

Leu 

570 





575 
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Trp Leu Arg Lys Cys Leu Arg Lys Ala Lys Lys Phe Val Pro Ala Ser 

580 585 590 

Ser Cys Gin Ser Leu Glu Ser Asp Gly Ser Tyr Gin Lys Pro Ser Tyr 
595 600 605 

lie Leu 
610 

<210> 37 

<211> 1922 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (406) . . . (1428) 

<223> Nucleotide sequence encoding nucleotide binding 
protein (G Protein), beta polypeptide 3 (GNB3) 

<400> 37 

ccacaatagg ggcagacctg tccatccttc tctgtgggtc ccctgtacct ttctccccca 60 

acaggatcag acccagaggc agctggttgg ggtttgtcga gaagaaggat tatccagatc 12 0 

agtcctttct aatctcagct cctgcctgta ccctcccata ctcaccaaac cctcttcccc 180 

accaccctga gctgaggagc acagtttgag gcccccccaa ccccccgccg gtcggggcca 240 

ggccaggcca ggccagctcc tctggcagca gagcctgggc aggtgacggg cgggcgcggg 300 

cgtcgcagct gagggagtaa ggaggctccc aggaaccgga gctggaaacc cggccgaggt 360 

ccagccagag cccaagagcc agagtgaccc ctcgacctgt cagcc atg ggg gag atg 417 

Met Gly Glu Met 
1 


gag caa ctg cgt cag gaa gcg gag cag etc aag aag cag att gca gat 
Glu Gin Leu Arg Gin Glu Ala Glu Gin Leu Lys Lys Gin lie Ala Asp 
5 10 15 20 


465 


gcc agg aaa gcc tgt get gac gtt act ctg gca gag ctg gtg tct ggc 
Ala Arg Lys Ala Cys Ala Asp Val Thr Leu Ala Glu Leu Val Ser Gly 
25 30 35 


513 


eta gag gtg gtg gga cga gte cag atg egg aeg egg egg aeg tta agg 
Leu Glu Val Val Gly Arg Val Gin Met Arg Thr Arg Arg Thr Leu Arg 
40 45 50 


561 


gga cae ctg gcc aag att tac gcc atg cac tgg gee act gat tct aag 
Gly His Leu Ala Lys lie Tyr Ala Met His Trp Ala Thr Asp Ser Lys 
55 60 65 


609 


ctg ctg gta agt gee teg caa gat ggg aag ctg ate gtg tgg gac age 
Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu lie Val Trp Asp Ser 
70 75 80 


657 


tac acc ace aac aag gtg cae gee ate cea ctg ege tec tee tgg gte 
Tyr Thr Thr Asn Lys Val His Ala lie Pro Leu Arg Ser Ser Trp Val 
85 90 95 100 


705 


atg ace tgt gee tat gcc cea tea ggg aac ttt gtg gca tgt ggg ggg 
Met Thr Cys Ala Tyr Ala Pro Ser Gly Asn Phe Val Ala Cys Gly Gly 
105 110 115 


753 


ctg gac aac atg tgt tec ate tac aac etc aaa tec cgt gag ggc aat 


801 
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Leu Asp Asn Met Cys Ser lie Tyr Asn Leu Lys Ser Arg Glu Gly Asn 
120 125 130 

gtc aag gtc age egg gag ctt tct get cac aca ggt tat etc tee tge 84 9 

Val Lys Val Ser Arg Glu Leu Ser Ala His Thr Gly Tyr Leu Ser Cys 
135 140 145 

tge ege tte ctg gat gac aae aat att gtg ace age teg ggg gae ace 897 
Cys Arg Phe Leu Asp Asp Asn Asn lie Val Thr Ser Ser Gly Asp Thr 
150 155 160 

aeg tgt gee ttg tgg gac att gag act ggg eag cag aag act gta ttt 945 
Thr Cys Ala Leu Trp Asp lie Glu Thr Gly Gin Gin Lys Thr Val Phe 
165 170 175 180 

gtg gga eae aeg ggt gac tge atg age ctg get gtg tct cet gac tte 993 
Val Gly His Thr Gly Asp Cys Met Ser Leu Ala Val Ser Pro Asp Phe 
185 190 195 

aat etc tte att teg ggg gee tgt gat gee agt gee aag etc tgg gat 1041 
Asn Leu Phe lie Ser Gly Ala Cys Asp Ala Ser Ala Lys Leu Trp Asp 
200 205 210 

gtg ega gag ggg ace tge egt eag act tte act ggc cac gag teg gae 1089 
Val Arg Glu Gly Thr Cys Arg Gin Thr Phe Thr Gly His Glu Ser Asp 
215 220 225 

ate aae gee ate tgt tte tte cec aat gga gag gee ate tge aeg ggc 1137 
lie Asn Ala lie Cys Phe Phe Pro Asn Gly Glu Ala lie Cys Thr Gly 
230 235 240 

teg gat gae get tee tge cge ttg ttt gae ctg egg gea gac eag gag 1185 
Ser Asp Asp Ala Ser Cys Arg Leu Phe Asp Leu Arg Ala Asp Gin Glu 
245 250 255 260 

ctg ate tge tte tec cac gag age ate ate tge ggc ate aeg tee gtg 1233 
Leu lie Cys Phe Ser His Glu Ser lie lie Cys Gly lie Thr Ser Val 
265 270 275 

gee tte tee etc agt ggc ege eta eta tte get ggc tae gac gac tte 1281 
Ala Phe Ser Leu Ser Gly Arg Leu Leu Phe Ala Gly Tyr Asp Asp Phe 
280 285 290 

aae tge aat gtc tgg gac tec atg aag tct gag egt gtg ggc ate etc 1329 
Asn Cys Asn Val Trp Asp Ser Met Lys Ser Glu Arg Val Gly lie Leu 
295 300 305 

tct ggc eae gat aac agg gtg age tge ctg gga gtc aca get gae ggg 1377 
Ser Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Ala Asp Gly 
310 315 320 

atg get gtg gee aca ggt tec tgg gac age tte etc aaa ate tgg aae 1425 
Met Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu Lys lie Trp Asn 
325 330 335 340 

tga ggaggctgga gaaagggaag tggaaggcag tgaacacact eagcagccec 1478 
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ctgcccgacc 
gctttctcct 
gacacagggg 
cagcctctcc 
agact tgagt 
aggcctgggt 
tggccctctt 
tgcaataaag 


ccatctcatt 
ttgagggcag 
caaagaactg 
cttaatgagc 
ctgaggcccc 
ggtatagggc 
ct tattcatg 
tgtagcaccc 


caggtgttct 
tggggagcat 
ccccatctcc 
aaggacaacc 
aggccctagg 
gtttggccct 
ctttctcctt 
tggt 


cttctatatt 
gggactgtgc 
tcccatggcc 
tgcccctccc 
at tcctcccc 
gtgactatgg 
tttctacctt 


ccgggtgcca 
ctttgggagg 
ttccctcccc 
cagccctttg 
cagagccact 
ctctggcacc 
tttttctctc 


ttcccactaa 
cagcatcagg 
acagtcctca 
caggcccagc 
acctttgtcc 
actagggtcc 
ctaagacacc 


<210> 38 

<211> 340 

<212> PRT 

<213> Homo sapien 

<400> 38 

Met Gly Glu Met Glu Gin 

1 5 
Gin He Ala Asp Ala Arg 

20 

Leu Val Ser Gly Leu Glu 
35 

Arg Thr Leu Arg Gly His 
50 

Thr Asp Ser Lys Leu Leu 
65 70 
Val Trp Asp Ser Tyr Thr 
85 

Ser Ser Trp Val Met Thr 
100 

Ala Cys Gly Gly Leu Asp 
115 

Arg Glu Gly Asn Val Lys 
130 

Tyr Leu Ser Cys Cys Arg 
145 150 
Ser Gly Asp Thr Thr Cys 
165 

Lys Thr Val Phe Val Gly 
180 

Ser Pro Asp Phe Asn Leu 
195 

Lys Leu Trp Asp Val Arg 
210 

His Glu Ser Asp He Asn 
225 230 
He Cys Thr Gly Ser Asp 
245 

Ala Asp Gin Glu Leu He 

260 

He Thr Ser Val Ala Phe 

275 

Tyr Asp Asp Phe Asn Cys 
290 

Val Gly He Leu Ser Gly 
305 310 
Thr Ala Asp Gly Met Ala 
325 


Leu Arg Gin 
Lys 
Val 


Leu 

55 

Val 

Thr 

Cys 

Asn 

Val 
135 
Phe 

Ala 

His 

Phe 

Glu 
215 
Ala 

Asp 

Cys 

Ser 

Asn 
295 
His 

Val 


Ala Cys 

25 
Val Gly 
40 

Ala Lys 


Ser Ala 

Asn Lys 

Ala Tyr 
105 
Met Cys 
120 

Ser Arg 

Leu Asp 

Leu Trp 

Thr Gly 
185 
He Ser 
200 

Gly Thr 

He Cys 

Ala Ser 

Phe Ser 
265 
Leu Ser 
280 

Val Trp 
Asp Asn 
Ala Thr 


Glu 

10 

Ala 

Arg 

He 

Ser 

Val 

90 

Ala 

Ser 

Glu 

Asp 

Asp 
170 
Asp 

Gly 

Cys 

Phe 

Cys 
250 
His 

Gly 

Asp 

Arg 

Gly 
330 


Ala 

Asp 

Val 

Tyr 

Gin 

75 

His 

Pro 

He 

Leu 

Asn 
155 
He 

Cys 

Ala 

Arg 

Phe 
235 
Arg 

Glu 

Arg 

Ser 

Val 
315 
Ser 


Glu 

Val 

Gin 

Ala 

60 

Asp 

Ala 

Ser 

Tyr 

Ser 
140 
Asn 

Glu 

Met 

Cys 

Gin 
220 
Pro 

Leu 

Ser 

Leu 

Met 
300 
Ser 

Trp 


Gin 

Thr 

Met 

45 

Met 

Gly 

He 

Gly 

Asn 
125 
Ala 

He 

Thr 

Ser 

Asp 
205 
Thr 

Asn 

Phe 

He 

Leu 
285 
Lys 

Cys 

Asp 


Leu 

Leu 

30 

Arg 

His 

Lys 

Pro 

Asn 
110 
Leu 

His 

Val 

Gly 

Leu 
190 
Ala 

Phe 

Gly 

Asp 

He 
270 
Phe 

Ser 

Leu 

Ser 


Lys 

15 

Ala 

Thr 

Trp 

Leu 

Leu 

95 

Phe 


Lys 

Glu 

Arg 

Ala 

He 

80 

Arg 

Val 


Lys Ser 
Thr 
Thr 


Gly 


Gin 
175 
Ala 

Ser 

Thr 

Glu 

Leu 
255 
Cys 

Ala 

Glu 

Gly 

Phe 
335 


Ser 
160 
Gin 

Val 

Ala 

Gly 

Ala 
240 
Arg 

Gly 

Gly 

Arg 

Val 
320 
Leu 


1538 
1598 
1658 
1718 
1778 
1838 
1898 
1922 
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Lys lie Trp Asn 
340 

<210> 39 

<211> 2443 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (162) . . . (1253) 

<223> Nucleotide sequence encoding angiotensin receptor 
2 (AGTR2) 


<400> 39 

acgtcccagc gtctgagaga acgagtaagc aagaattcaa agcattctgc agcctgaatt 
ttgaaggagt gtgtttaggc actaagcaag ctgatttatg ataactgctt taaacttcaa 
caaccaaagg cataagaact aggagctgct gacatttcaa t atg aag ggc aac tec 

Met Lys Gly Asn Ser 
1 5 


60 
120 
176 


acc ctt gcc act act age aaa aac att acc age ggt ctt cac ttc ggg 
Thr Leu Ala Thr Thr Ser Lys Asn lie Thr Ser Gly Leu His Phe Gly 
10 15 20 


224 


ctt gtg aac ate tct ggc aac aat gag tet ace ttg aac tgt tea cag 
Leu Val Asn lie Ser Gly Asn Asn Glu Ser Thr Leu Asn Cys Ser Gin 
25 30 35 


272 


aaa cca tea gat aag cat tta gat gca att cet att ctt tac tac att 
Lys Pro Ser Asp Lys His Leu Asp Ala lie Pro lie Leu Tyr Tyr lie 
40 45 50 


320 


ata ttt gta att gga ttt etg gtc aat att gte gtg gtt aca etg ttt 
lie Phe Val lie Gly Phe Leu Val Asn lie Val Val Val Thr Leu Phe 
55 60 65 


368 


tgt tgt caa aag ggt cet aaa aag gtt tct age ata tac ate ttc aac 
Cys Cys Gin Lys Gly Pro Lys Lys Val Ser Ser lie Tyr lie Phe Asn 
70 75 80 85 


416 


etc get gtg get gat tta etc ctt ttg get act ctt cet eta tgg gca 
Leu Ala Val Ala Asp Leu Leu Leu Leu Ala Thr Leu Pro Leu Trp Ala 
90 95 100 


464 


acc tat tat tct tat aga tat gae tgg etc ttt gga cet gtg atg tgc 
Thr Tyr Tyr Ser Tyr Arg Tyr Asp Trp Leu Phe Gly Pro Val Met Cys 
105 110 115 


512 


aaa gtt ttt ggt tet ttt ctt ace etg aac atg ttt gca age att ttt 
Lys Val Phe Gly Ser Phe Leu Thr Leu Asn Met Phe Ala Ser lie Phe 
120 125 130 


560 


ttt ate acc tgc atg agt gtt gat agg tac caa tct gtc ate tac cec 
Phe lie Thr Cys Met Ser Val Asp Arg Tyr Gin Ser Val lie Tyr Pro 
135 140 145 


608 


ttt etg tet caa aga aga aat cec tgg caa gca tet tat ata gtt cec 


656 


wo 02/072604 


PCT/US02/06728 


-83- 


Phe Leu Ser Gin Arg Arg Asn Pro Trp Gin Ala Ser Tyr lie Val Pro 
150 155 160 165 

ctt gtt tgg tgt atg gcc tgt ttg tec tea ttg cea aca ttt tat ttt 704 
Leu Val Trp Cys Met Ala Cys Leu Ser Ser Leu Pro Thr Phe Tyr Phe 
170 175 180 

cga gac gtc aga acc att gaa tac tta gga gtg aat get tge att atg 752 
Arg Asp Val Arg Thr lie Glu Tyr Leu Gly Val Asn Ala Cys lie Met 
185 190 195 

get tte eea ect gag aaa tat gee eaa tgg tea get ggg att gee tta 800 
Ala Phe Pro Pro Glu Lys Tyr Ala Gin Trp Ser Ala Gly lie Ala Leu 
200 205 210 

atg aaa aat ate ett ggt ttt att ate eet tta ata tte ata gca aea 848 
Met Lys Asn lie Leu Gly Phe lie lie Pro Leu lie Phe lie Ala Thr 
215 220 225 

tge tat ttt gga att aga aaa cac tta etg aag acg aat age tat ggg 896 
Cys Tyr Phe Gly lie Arg Lys His Leu Leu Lys Thr Asn Ser Tyr Gly 
230 235 240 245 

aag aac agg ata acc cgt gac eaa gtc etg aag atg gca get get gtt 944 
Lys Asn Arg lie Thr Arg Asp Gin Val Leu Lys Met Ala Ala Ala Val 
250 255 260 

gtt etg gcc tte ate att tgg tge ctt cec tte cat gtt etg ace tte 992 
Val Leu Ala Phe lie lie Trp Cys Leu Pro Phe His Val Leu Thr Phe 
265 270 275 

etg gat get etg gee tgg atg ggt gtc att aat age tge gaa gtt ata 1040 
Leu Asp Ala Leu Ala Trp Met Gly Val He Asn Ser Cys Glu Val He 
280 285 290 

gca gtc att gac etg gca ctt cct ttt gee ate etc ttg gga tte ace 1088 
Ala Val He Asp Leu Ala Leu Pro Phe Ala He Leu Leu Gly Phe Thr 
295 300 305 

aac age tge gtt aat ccg ttt etg tat tgt ttt gtt gga aac egg tte 1136 
Asn Ser Cys Val Asn Pro Phe Leu Tyr Cys Phe Val Gly Asn Arg Phe 
310 315 320 325 

caa cag aag etc egc agt gtg ttt agg gtt cea att act tgg etc caa 1184 
Gin Gin Lys Leu Arg Ser Val Phe Arg Val Pro He Thr Trp Leu Gin 
330 335 340 

ggg aaa aga gag agt atg tct tge egg aaa age agt tet ctt aga gaa 1232 
Gly Lys Arg Glu Ser Met Ser Cys Arg Lys Ser Ser Ser Leu Arg Glu 
345 350 355 

atg gag acc ttt gtg tct taa acggagagea aaatgeatgt aatcaaeatg 1283 
Met Glu Thr Phe Val Ser * 
360 

gctacttgct ttgaggctca ecagaattat ttttaagtgg ttttaataaa ataataaaat 1343 

ttcecetaat ettttctgaa tcttctgaaa ccaaatgtaa etatgtttat cgtecagtga 1403 

etttcaggaa tgeccattgt tttctgatat gtttgtaeaa gatttcattg gtgagacata 1463 
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tttacaacct 
taatgatttg 
atatccattt 
atttacaaga 
ttagtactgg 
tcctcttgca 
ggctaggaat 
agcaagatgt 
gaataagcac 
ttctctgata 
cttttcacat 
tttattatta 
actct ttaac 
tgctacatgg 
aaaggaacct 
taccctatat 
aaaccaattg 


agaagtaact 
gggattcaga 
ttatcaggat 
caacattgta 
attattcagg 
tttcacttga 
atagattaaa 
actataacat 
tttttaaaaa 
cttttttgaa 
ccttgacttt 
atgctttggt 
ttgtaataaa 
ggtctgggtg 
gtcagggcag 
ctataaacac 
caggtctaga 


ggtgatatat 
tttctctttg 
ttcctcttga 
agagagatga 
ctttaggcat 
gtggaggttt 
tcatactcct 
agaattgcaa 
actttctact 
atcagtaaac 
ttagatgtgc 
tctgggttgt 
cccttaactg 
ggggcaaaga 
tacaatgtga 
tgtttgttcc 


ctcaaattgt 
aaacatgctt 
accagaacca 
gcacttctaa 
atgcttcttt 
atagttaatc 
atgctttagc 
tctataatat 
cattttaatg 
actgtgtatt 
tgctttgata 
ttcctaaaat 
gcataggaaa 
gacccagtca 
ctttgaaaat 
agaatctgta 


aat taataat 
gtgtttctta 
gtctt tcaac 
gt tgagtata 
aaaaacgcta 
tataactaca 
ttatttttac 
ttgtgtgttc 
attgtttaaa 
gt tgtaaaat 
tataggacat 
atctgggtgg 
tggtatccag 
attacatgtt 
atataccgtg 
tgattctatg 


agattgtgaa 
gtggggt ttt 
tcattgcatc 
ttataataga 
taaattatat 
tattgaatag 
agttatagaa 
actaaactct 
ggtttctatt 
gtaaaggtca 
tgatttgatt 
cttaaaaaaa 
aatggaattt 
tggtaccaag 
ggggtagttt 
gagctatttt 


1523 
1583 
1643 
1703 
1763 
1823 
1883 
1943 
2003 
2063 
2123 
2183 
2243 
2303 
2363 
2423 
2443 


<210> 40 

<211> 363 

<212> PRT 

<213> Homo sapien 


<400> 40 






Met 

Lys 

Gly 

Asn 

Ser 

Thr 

Leu 

Ala 

1 




5 




Gly 

Leu 

His 

Phe 

Gly 

Leu 

Val 

Asn 




20 





Leu 

Asn 

Cys 

Ser 

Gin 

Lys 

Pro 

Ser 



35 





40 

He 

Leu 

Tyr 

Tyr 

He 

He 

Phe 

Val 


50 





55 


Val 

Val 

Thr 

Leu 

Phe 

Cys 

Cys 

Gin 

65 





70 



He 

Tyr 

He 

Phe 

Asn 

Leu 

Ala 

Val 





85 




Leu 

Pro 

Leu 

Trp 

Ala 

Thr 

Tyr 

Tyr 




100 





Gly 

Pro 

Val 

Met 

Cys 

Lys 

Val 

Phe 



115 





120 

Phe 

Ala 

Ser 

He 

Phe 

Phe 

He 

Thr 


130 





135 


Ser 

Val 

He 

Tyr 

Pro 

Phe 

Leu 

Ser 

145 





150 



Ser 

Tyr 

He 

Val 

Pro 

Leu 

Val 

Trp 





165 




Pro 

Thr 

Phe 

Tyr 

Phe 

Arg 

Asp 

Val 




180 





Asn 

Ala 

Cys 

He 

Met 

Ala 

Phe 

Pro 



195 





200 

Ala 

Gly 

He 

Ala 

Leu 

Met 

Lys 

Asn 


210 





215 


He 

Phe 

He 

Ala 

Thr 

Cys 

Tyr 

Phe 

225 





230 



Thr 

Asn 

Ser 

Tyr 

Gly 

Lys 

Asn 

Arg 





245 




Met 

Ala 

Ala 

Ala 

Val 

Val 

Leu 

Ala 




260 






Thr 

Thr 

Ser 

Lys 

Asn 

He 

Thr 

Ser 


10 





15 


He 

Ser 

Gly 

Asn 

Asn 

Glu 

Ser 

Thr 

25 





30 



Asp 

Lys 

His 

Leu 

Asp 

Ala 

He 

Pro 





45 




He 

Gly 

Phe 

Leu 

Val 

Asn 

He 

Val 



60 





Lys 

Gly 

Pro 

Lys 

Lys 

Val 

Ser 

Ser 



75 





80 

Ala 

Asp 

Leu 

Leu 

Leu 

Leu 

Ala 

Thr 


90 





95 


Ser 

Tyr 

Arg 

Tyr 

Asp 

Trp 

Leu 

Phe 

105 





110 



Gly 

Ser 

Phe 

Leu 

Thr 

Leu 

Asn 

Met 





125 




Cys 

Met 

Ser 

Val 

Asp 

Arg 

Tyr 

Gin 




140 





Gin 

Arg 

Arg 

Asn 

Pro 

Trp 

Gin 

Ala 



155 





160 

Cys 

Met 

Ala 

Cys 

Leu 

Ser 

Ser 

Leu 


170 





175 


Arg 

Thr 

He 

Glu 

Tyr 

Leu 

Gly 

Val 

185 





190 



Pro 

Glu 

Lys 

Tyr 

Ala 

Gin 

Trp 

Ser 





205 




He 

Leu 

Gly 

Phe 

He 

He 

Pro 

Leu 




220 





Gly 

He 

Arg 

Lys 

His 

Leu 

Leu 

Lys 



235 





240 

He 

Thr 

Arg 

Asp 

Gin 

Val 

Leu 

Lys 


250 





255 


Phe 

He 

He 

Trp 

Cys 

Leu 

Pro 

Phe 

265 





270 
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His 

Val 

Leu 

275 

Thr 

Phe 

Leu 

Asp 

Ala 
280 

Leu 

Ala 

Trp 

Met 

Gly 
285 

Val 

He 

Asn 

Ser 

Cys 
290 

Glu 

Val 

He 

Ala 

Val 
295 

He 

Asp 

Leu 

Ala 

Leu 
300 

Pro 

Phe 

Ala 

He 

Leu 

Leu 

Gly 

Phe 

Thr 

Asn 

Ser 

Cys 

Val 

Asn 

Pro 

Phe 

Leu 

Tyr 

Cys 

Phe 

305 





310 





315 





320 

Val 

Gly 

Asn 

Arg 

Phe 
325 

Gin 

Gin 

Lys 

Leu 

Arg 
330 

Ser 

Val 

Phe 

Arg 

Val 
335 

Pro 

He 

Thr 

Trp 

Leu 
340 

Gin 

Gly 

Lys 

Arg 

Glu 
345 

Ser 

Met 

Ser 

Cys 

Arg 
350 

Lys 

Ser 

Ser 

Ser 

Leu 
355 

Arg 

Glu 

Met 

Glu 

Thr 
360 

Phe 

Val 

Ser 







<210> 41 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 41 

actgcctgat aaccatgctg 20 

<210> 42 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 42 

atacttacac accaggaggg 20 

<210> 43 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 43 

atgcctgctc caaaggcac 19 

<210> 44 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 44 

atgcctgctc caaaggcacc 20 


<210> 45 
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<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 45 

atgcctgctc caaaggcaca t 21 

<210> 46 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 46 

tacttctggt tctctgagcg 20 

<210> 47 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 47, 

actcaccttg aactcgtctc 20 

<210> 48 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 48 

tggttctctg agcgagtctt 20 

<210> 49 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 49 

tggttctctg agcgagtctt c 21 

<210> 50 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 50 

tggttctctg agcgagtctt tc 22 

<210> 51 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 51 

tgcagatgga ctttggcttc 20 

<210> 52 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 52 

tgcttgcctt ctgctacaag 20 

<210> 53 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 53 

cttccctgag cacctgctg 19 

<210> 54 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 54 

cttccctgag cacctgctgg t 21 

<210> 55 
<211> 20 
<212> DNA 

<213> Artificial Sequence 


<220> 

<223> Primer 
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<400> 55 

cttccctgag cacctgctga 20 

<210> 56 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 56 

aacagctcag gacgaaactg 20 

<210> 57 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 57 

agaaggagtt gaccttgtcc 20 


<210> 

58 

<211> 

19 

<212> 

DNA 

<213> 

Artificial 

<220> 


<223> 

Primer 

<400> 

58 


ggaagctcaa gtggccttc 19 

<210> 59 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 59 

ggaagctcaa gtggccttcc 20 

<210> 60 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 


<400> 60 

ggaagctcaa gtggccttca ac 


22 
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<210> 61 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 61 

aagtcactgg cagagctgg 19 

<210> 62 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 62 

gcaccagggc tttgttgaag 20 

<210> 63 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 63 

ttttccccgt agggctcca 19 

<210> 64 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 64 

ttttccccgt agggctccac 20 

<210> 65 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 65 

ttttccccgt agggctccag c 21 


<2 
<2 
<2 


1 
1 
1 


0> 66 
1> 20 
2> DNA 
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<213> 


Artificial Sequence 


<220> 
<223> 


Primer 


<400> 66 

tgcagaagtc actggcagag 


20 


<210> 67 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 67 

gttgaagttt tccccgtagg 20 

<210> 68 
<211> 19 
<212> DNA 

<213> Aritificial sequence 
<400> 68 

actcctccac ctgctggtc 19 

<210> 69 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 69 

actcctccac ctgctggtcc 20 

<210> 70 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 70 

actcctccac ctgctggtct a 21 

<210> 71 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 


<400> 71 
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aggacgtgcg tggcaacctg 20 

<210> 72 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 72 

agctctgcca gtgacttctg 20 

<210> 73 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 73 

gtgacttctg cagcccctc 19 

<210> 74 
<211> 20 
<212> DNA 

<213> Artitifical secjuence 
<400> 74 

gtgacttctg cagcccctca 20 

<210> 75 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 75 

gtgacttctg cagcccctcg gt 22 

<210> 76 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 76 

cctgaccttc cagatgaag 19 

<210> 77 
<211> 19 
<212> DNA 

<213> Artificial Sequence 


wo 02/072604 


PCT/US02/06728 


-92- 


<220> 

<223> Primer 
<400> 77 

tcaggttgcc acgcacgtc 19 

<210> 78 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 78 

caggatctcg gccagtgc 18 

<210> 79 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 79 

caggatctcg gccagtgcc 19 

<210> 80 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 80 

caggatctcg gccagtgctg 20 

<210> 81 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 81 

acctgcgaga gcttcagcag 20 

<210> 82 
<211> 20 
<212> DNA 

<213> Artificial Sequence 


<220> 

<223> Primer 


wo 02/072604 


PCT/US02/06728 


-93- 


<400> 82 

tctccatgcg ctgtgcgtag 20 

<210> 83 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 83 

agctgcgcac ccaggtca 18 


<210> 

84 

<211> 

19 

<212> 

DNA 

<213> 

Artificial Sequence 

<220> 


<223> 

Primer 

<400> 

84 

agctgcgcac ccaggtcaa 

<210> 

85 

<211> 

20 

<212> 

DNA 

<213> 

Artificial Sequence 

<220> 


<223> 

Primer 

<400> 

85 


19 


agctgcgcac ccaggtcagc 2 0 


<210> 

86 

<211> 

19 

<212> 

DNA 

<213> 

Artificial Sequence 

<220> 


<223> 

Primer 

<400> 

86 

tgtccaagga gctgcaggc 

<210> 

87 

<211> 

20 

<212> 

DNA 

<213> 

Artificial Sequence 

<220> 


<223> 

Primer 


19 


<400> 87 

cttacgcagc ttgcgcaggt 


20 


wo 02/072604 


PCT/US02/06728 


-94- 


<210> 88 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 88 

gcggacatgg aggacgtg 18 

<210> 89 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 89 

gcggacatgg aggacgtgc 19 

<210> 90 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 90 

gcggacatgg aggacgtgtg 2 0 

<210> 91 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 91 

gttgtagaaa gaaccgctgc 20 

<210> 92 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 92 

gagaacgagt cttcaggtac 20 

<210> 93 
<211> 21 
<212> DNA 


wo 02/072604 


PCT/US02/06728 


-95- 


<213> 

Artificial Sequence 

<220> 


<223> 

Primer 

<400> 

93 

acaat 

ctggg ctatgagatc a 

<210> 

94 

<211> 

22 

<212> 

DNA 

<213> 

Artificial Sequence 

<220> 


<223> 

Primer 

<400> 

94 


21 


acaatctggg ctatgagatc aa 22 

<210> 95 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 95 

acaatctggg ctatgagatc agt 23 


<210> 

96 

<211> 

20 

<212> 

DNA 

<213> 

Artificial Sequence 

<220> 


<223> 

Primer 

<400> 

96 

cactc 

tacac tgcatgtctc 

<210> 

97 

<211> 

20 

<212> 

DNA 

<213> 

Artificial Sequence 

<220> 


<223> 

Primer 

<400> 

97 

accct 

tctga aaaggagagg 

<210> 

98 

<21l> 

20 

<212> 

DNA 

<213> 

Artificial Sequence 


20 


20 


<220> 


wo 02/072604 


PCT/US02/06728 


-96- 


<223> Primer 
<400> 98 

gaggagagac aaggcagata 2 0 

<210> 99 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 99 

gaggagagac aaggcagata t 21 

<210> 100 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 100 

gaggagagac aaggcagata gt 22 


<210> 

101 

<211> 

20 

<212> 

DNA 

<213> 

Artificial 

<220> 


<223> 

Primer 

<400> 

101 


aaaggttcag ttgctgctgc 20 


<210> 

102 

<211> 

20 

<212> 

DNA 

<213> 

Artificial Sequence 

<220> 


<223> 

Primer 

<400> 

102 

gctggggaag gtctaataac 

<210> 

103 

<211> 

19 

<212> 

DNA 

<213> 

Artificial Sequence 

<220> 


<223> 

Primer 


20 


<400> 103 


wo 02/072604 


PCT/US02/06728 


-97- 


gttgctgctg cctcgaatc 19 

<210> 104 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 104 

gttgctgctg cctcgaatcc 20 

<210> 105 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 105 

gttgctgctg cctcgaatct g 21 

<210> 106 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 106 

cgtctttctc cagatgatgc 20 

<210> 107 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 107 

agtgtcctat gggctgtttg 20 

<210> 108 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 108 

ggatgccatt cataccttta c 21 


<210> 109 


wo 02/072604 


PCT/US02/06728 


-98- 


<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 109 

ggatgccatt cataccttta cc 22 


<210> 

110 

<211> 

23 

<212> 

DNA 

<213> 

Artificial 

<220> 


<223> 

Primer 

<400> 

110 


ggatgccatt cataccttta cgc 23 

<210> 111 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 111 

tgggaaaaca gtgcagtgtg 20 

<210> 112 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 112 

tgatcgtctt cagaacgagg 20 

<210> 113 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 113 

ccagaccatc atcatcccat gga 23 

<210> 114 
<211> 21 
<212> DNA 

<213> Artificial Sequence 


wo 02/072604 


PCT/US02/06728 


-99- 


<220> 

<223> Primer 
<400> 114 

ccagaccatc atcccatgga a 21 


<210> 

115 

<211> 

22 

<212> 

DNA 

<213> 

Artificial 

<220> 


<223> 

Primer 

<400> 

115 


ccagaccatc atcccatgga gc 22 


<210> 

116 

<211> 

20 

<212> 

DNA 

<213> 

Artificial Sequence 

<220> 


<223> 

Primer 

<400> 

116 

cagcaatcgt ctttctccag 

<210> 

117 

<211> 

20 

<212> 

DNA 

<213> 

Artificial Sequence 

<220> 


<223> 

Primer 

<400> 

117 

tccta" 

tgggc tgtttgatgc 

<210> 

118 

<211> 

21 

<212> 

DNA 

<213> 

Artificial Sequence 

<220> 


<223> 

Primer 

<400> 

118 


20 


20 


gtctttctcc agatgatgcc a 21 

<210> 119 
<211> 22 
<212> DNA 

<213> Artificial Sequence 


<220> 

<223> Primer 


wo 02/072604 


PCT/US02/06728 


-100- 


<400> 119 

gtctttctcc agatgatgcc aa 22 

<210> 120 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 120 

gtctttctcc agatgatgcc agt 23 

<210> 121 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 121 

agcggataac aatttcacac agg 23 

<210> 122 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 


<400> 122 
ggcgcacgcc tccacg 


16 


